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*•  I13B0CgCTI0N 

A.   EBCBIZH 

Current  military  policy  includes  an  expansion  of  the 
Navy  frcm  the  present  level  cf  approximately  500  shits  to 
approximately  600  ships  by  the  end  of  the  decade.  Ihis 
expacsicr  is  being  conducted  at  a  time  when  the  primary 
recruiting  pool,  17  -  21  year  old  males,  is  declining.  Ihere 
are  t»o  avenues  which  must  be  pursued  to  prevent  a  short 
fall  in  personnel  tc  man  the  expanding  fleet.  First,  the 
Navy  must  retain  high  guality  personnel  and  secondly,  the 
Navy  must  recruit  sufficient  numbers  of  high  guality 
personnel  tc  replace  attritions  and  simultaneously  m€€t  ever 
increasing  manpower  requirements. 

It  is  not  enough  just  to  recruit  warm  todies  tc  fill 
vacancies.  The  ability  cf  the  American  serviceman  to 
adeguately  operate  and  maintain  equipment  of  increasing 
sophistication  has  been  widely  debated  in  Congress  and  the 
press.  Stcries  abound  of  the  expected  deterioraticn  cf 
eguipment  readiness  during  periods  of  increased  tensions  in 
the  event  of  the  loss  of  civilian  personnel.  To  maintain  an 
effective  fcrce,  the  Armed  Services  must  be  able  to  operate 
and  maintain  its'  eguipment  independent  of  noncemtatant 
civilian  technicians.  In  order  to  achieve  this  goal, 
personnel  with  the  anility  to  learn  the  operation  and  main- 
tenance of  ever  mere  sophisticated  eguipment  must  be 
cbtained  and  retained. 

Cne  avenue  proposed  to  alleviate  a  manpower  shortfall  is 
an  expanded  role  for  women  in  the  Armed  Forces.  This  would 
seem,  en  the  surface,  to  be  a  viable  solution  to  the  problem 
since  wemen  have  been  historically  less  expensive  to  recruit 
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and  retain  than  iten  of  comparable  quality  [Bef.  1]. 
However,  the  Navy  is  precluded  from  taking  full  advantage  of 
this  source  of  manpower  due  to  public  law  precluding  the 
permanent  assignment  of  women  to  combatant  ships.  This 
requires  focusing  the  recruiting  effort  towards  high  guality 
nen-  Ihe  definition  of  "high  guality"  has  been  widely 
debated.  For  the  purposes  of  this  study,  "high  guality"  will 
be  defined  as  high  school  graduates  in  Mental  Groups  I 
through  III  upper  as  measured  by  test  scores  on  the  Aried 
Forces  Vocational  Battery  of  tests  (ASVAB)  . 

E.   JCCCS 

This  thesis  is  concerned  with  developing  a  model  to 
forecast  the  number  of  high  guality  nonprior  service  sales 
signing  contracts  en  the  recruiting  district  level.  Ihe 
dependent  variable  is  net  contracts  signed:  CNTRCTS.  Values 
of  this  variable  are  determined  by  subtracting  Delayed  Entry 
frogram  (BEF)  attrites  during  the  month  of  attrition  from 
contracts  signed  during  the  same  month. 

Three  approaches  to  forecasting  CN1BC1S  are  taken. 
Eirst,  the  Box-Jenkins  methodology  is  applied  to  a  mcrthly 
time  series  of  the  net  number  of  nonprior  service  high 
school  graduate  males  signing  contracts  in  the  Albuguergue 
New  fiexico  recruiting  district  during  the  period  Octorer 
1978  to  September  1983  (60  data  points).  This  methodology  is 
explained  in  detail  in  Appendix  A.  The  second  approach 
utilizes  "best  fit"  lagged  variables  in  a  block  regression 
format  to  provide  a  causal  model  using  economic  data,  the 
number  of  production  recruiters  and  military/civilian  pay 
ratio  as  explanitorj  variables.  The  third  procedure 
combines  the  causal  icdel  and  the  Box-Jenkins  methodology  in 
a  mixed  time  series/causal  model  to  attempt  to  provide  mere 
accurate  forecasts. 
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C.   1EE  IB7IB0NHENT 

Ifcis  thesis  will  limit  the  scope  of  study  to  a  single 
recruiting  district.  It  is  hypothesized  that  better  esti- 
mates of  the  effects  of  changing  economic  factors  and  the 
effects  cf  the  variation  in  the  number  of  production  recrui- 
ters kill  re  obtained  at  this  level  than  on  a  national 
scope.  During  any  change  in  the  general  trends  of  the 
nation's  economy,  local  conditions  are  apt  to  vary  consider- 
ably ficm  recruiting  area  to  recruiting  area.  It  may  be  that 
the  economic  trends  in  a  particular  recruiting  district  are 
in  direct  exposition  tc  those  on  the  national  level.  Rhile 
goals  and  guotas  are  specified  at  a  national  level,  recruit- 
Bent  cf  individuals  cccurs  at  the  district  level.  The  Navy 
Recruiting  Command  (NBC)  allocates  a  fraction  cf  the 
national  gucta  to  each  of  the  Navy  Recruiting  Areas  (NBA) . 
The  six  KBA's  in  turn  allocate  fractions  of  their  guota  to 
each  Navy  Recruiting  District  (NBD)  within  the  area. 
Bistcrical  enlistment  rates  and  differences  in  regional 
population  are  primary  determinants  of  guota  allocation, 
although  economic  aid  demographic  factors  are  also  taken 
into  consideration  £Eef •  2  ].  Information  obtained  from  a 
regional  study,  through  a  better  understanding  of  the  inter- 
acting processes,  maj  be  cf  more  practical  value  in  allo- 
cating recruiting  resources  for  maximum  benefit,  acd  in  mere 
eguitatl}  assigning  recruiting  goals. 

lie  recruiting  district  chosen  for  study  is  the 
Albuguergue,  New  Mexico  recruiting  district.  This  district 
was  chosen  at  random  from  the  44  recruiting  districts  nation 
wide.  This  district  encompasses  the  entire  state  of  New 
Mexico,  the  panhandles  of  Texas  and  Oklahoma  as  well  as 
seven  ccunties  in  scuth  western  Kansas.  In  general  this 
district  experienced  an  increasing  work  force,  increasing 
unemjzlcyaent  and  a  significant  variation   in  the   number  of 
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froducti.cn  recruiters  during  the  period  of  study.  Appendix  B 
contains  the  time  series  plcts  of  the  variables. 

Background  demographic  factors  for  the  area  that  are  cf 
interest  are  as  follows.  The  average  per  capita  income  for 
the  state  cf  New  Mexico  in  1979  was  $  6119,  43  rd  in  the 
naticr-  The  populaticn  in  1980  was  75%  White,  2%  Black  and 
8%  American  Indian.  37%  of  the  population  was  of  Spanish 
crigiE.  1351  of  the  populaticn  was  between  18  and  24  years 
eld  ££€±.  3]. 
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II.    A    BEVIEi   QF    1BE   LIIEfilTUfiE    ON    MI1ITABY    SUEFLI 

A.       EABIIBG    (1979) 

In  his  thesis  or  the  supply  of  first  term  enlistees  to 
the  Urited  States  Marine  Corps,  Darling,  [Ref.  4],  provides 
a  detailed  review  of  the  literature  on  the  supply  models 
developed  from  the  Gates  Commission  (November  1970)  tc  the 
publication  of  his  thesis  (flarch  1979)  .  The  following 
review  will  cover  tie  pertinent  literature  commencing  with 
Iarling*s  wcrk. 

Darling  used  t*c  distinct  analytical  technigues  to 
develop  models  in  cider  to  forecast  monthly  first  term 
enlistees  in  the  United  States  Marine  Corps.  A  multiple 
regressicn  model  was  derived  based  on  it's  compatibility 
with  his  theory  of  occupational  choice,  the  intuitive  appeal 
cf  certain  explanatory  variables,  the  past  literature  of 
manpower  supply  and  the  statistical  significance  of  each 
variables  impact  on  monthly  enlistments.  The  data  base 
utilized  was  a  time  series  of  monthly  enlistments  spanning 
the  period  from  July  1973  to  June  1978. 

For  his  multiple  regressicn  model,  Darling  chose  as  the 
dependent  variable;  monthly  observations  of  the  ratio  of 
male  ncnpricr  service  high  school  graduates  enlistirg  ir  the 
regular  Marine  Corps  to  male  high  school  graduates,  aged  16 
to  24,  net  enrolled  in  college  (S) .  Darling  chose  as  the 
independent  variables:  (1)  Civilian  -  Military  pay  ratio 
(CMPB),  (2)  a  deseascnalized  estimate  cf  the  national  unem- 
ploynent  rate  for  persons  aged  16  to  19  (UNEM) ,  (3)  the 
monthly  nunrer  of  qualified  leads  obtained  from  postcards 
which  were  included  in  the  national  printed  media  advertise- 
ments (LEADS),    (4)   a  dummy  variable   to  correct   for  the 
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abnormally  large  number  of  enlistments  in  the  month  of 
Eecemter  1976  to  tak€  advantage  of  the  expiring  GI  Bill  (EJ) 
and  (5)  the  number  cf  Marine  recruiters  on  duty  each  month 
(EFFEEC) .  His  initial  hypothesis  was  that  the  number  of 
monthly  enlistments  is  dependent  on  the  above  independent 
variables  in  the  functional  form  of  a  logistic  model: 


S=  fUeip  [-(B^^CMEB 


♦  BpEJfBEC  ♦B-OMM  *B.  LEADS  ♦Bj-DJ  ♦  a)JJ  -» 

Cnly  these  regular  enlistments  of  male  high  schcel  grad- 
uates in  the  top  two  mental  groups  were  considered  tc  ensure 
that  the  observed  lumber  of  menthly  enlistments  was  cot 
restricted  by  Marine  Corps  policy  (demand  driven)  and  was 
therefore  ar  accurate  representation  of  points  on  the  supply 
curve.  All  the  independent  variables  listed  above  (except 
EJ)  were  lagged  from  one  tc  six  months  and  regressiens  were 
run  tc  discover  those  variables  that  significantly  contrib- 
uted tc  the  pattern  of  enlistments  and  to  ascertain  the 
apprciimate  delay  ic  the  impact  on  contracts  signed  of  a 
change  in  the  value  of  one  of  the  variables.  The  final 
regression  equation  selected  was  based  on  the  intuitive 
appeal  cf  the  regression  eguation  and  the  statistical 
significance  of  the  results.  The  best  fit  equation  was 
determined  to  be: 

LM/SN  «  -2.669  -5.647  CMFR5  +1.00  UNEM  ♦  0.672  EJ  +a 
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where : 

S     ■  the  enlistment  rate 

CMEB5  ■  civilian  -  ailitary  cay  ratio 
lagged  five  months 

ONEM   =  unemployment  with  no  lag 

EJ    =  dummy  variable  for  Eecember,  1976 
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a  =  random   error. 

all  independent  variables  were  reported  to  be  signifi- 
cant at  the  10%  level  with  a  coefficient  of  determination, 
E-sguared  value,  of  .42.  Of  significance  is  tie  exclusion 
of  the  variable  representing  recruiters.  Darling  reported 
that  when  this  variable  was  forced  into  the  regression  equa- 
tion, its'  regression  coefficient  was  never  statistically 
different  from  zero.  Darling  also  reported  a  Dirbin-Watson 
statistic  of  1.0  8,  indicating  a  positive  serial  correla- 
tion among  the  residuals.  He  felt  that  this  was  probably  due 
to  the  lack  of  an  attempt  to  explain  the  seasonal  nature  of 
Marine  Corps  enlistments  to  this  point  in  his  study.  In  an 
attempt  to  correct  this  problem,  the  Box-Jenkins  methodology 
was   applied. 

Darling  utilized  the  data  from  the  multiple  regression 
model  in  a   Box-Jenkins  procedure   to   arrive   at   a   final   model: 

YT  =    .  804251YT.|  ♦YT.lz-.8042  51  YT_„-7.  095902   +  e.,-.  47  8727eT_,2 

Where  Y  was  the  number  of  male  high  school  graduates  in 
mental  groups  I  and  II  who  enlisted  in  month  t  and  e  was 
the  error  term.  Utilizing  this  model,  he  conducted  two 
forecasts  with  varying  degrees  of  success.  The  first  fore- 
cast conducted  incorporated  the  abnormally  higi  enlistment 
month  of  December  1976.  This  resulted  in  the  distortion  of 
the  forecasts.  For  the  second  forecast,  not  incorporating 
December  1976,  the  pattern  of  enlistments  was  well  repre- 
sented  by   the   model. 

Darling  concluded  his  thesis  by  combining  the  two 
methods  in  applying  Box-Jenkins  to  the  residuals  of  the 
multiple  regression  analysis  as  if  they  were  an  original 
time  series.  He  then  used  this  combined  model  to  forecast 
Marine  enlistments  with  a  greater  degree  of  accuracy  than 
with   either   model   individually. 
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E.   HIE  AHE  GILBOY  (1983) 

Eale  and  Gilroy  in  two  studies  of  the  economic  deterni- 
nants  cf  nilitary  enlistment  rates  [Ref.  5  and  6]  utilized  a 
regression  equation  to  forecast  enlistment  rates  cf  male 
high  schccl  graduates  in  mental  groups  I  through  III  upper 
for  the  Amy.  Ihe  dependent  variable  was  total  contract 
signers  divided  ry  tie  relevant  civilian  population  cf  16  to 
19  year  elds.  Their  basic  equations  were  estimated  with  a 
linear  functional  form  using  generalized  least  squares, 
correcting  for  the  presence  of  first  order  autocorrelation. 
Their  equation  for  all  males  was: 

A/P  =  -1S66.90  «•  16. 3S  OM  ♦  7.77  OM-2  ♦  4.59  OM-4 

♦  40.89    H4      *    313.07    BUI    ♦    408.40    VEAP 

♦  6.26    KICK    ♦    64.65    Q3 

♦  819.73    GI    ♦    148.38    TABGET   -    0.00    BECBTB 

where: 

A     =  Army  accessions  plus  OEP  of  male 
NES  high  school  grads  in  mental 
groups  I  through  III  upper. 

P     =  Eevelant  male  population  of 
16  to  19  year  olds. 

UM    =  Current  uneaployment  rate  of 
16  to  19  year  olds. 

UM-2   =  DM  lagged  two  months. 

UM-4   =  OH  lagged  four  months. 

H4    =  Ratio  cf  first  year  fcasic  military 
compensation  to  average 
weekly  earnings  in  the  private  sector, 
with  a  four  month  lead. 

GI     =  Dummy  variarle  =  1  in  December,  1976  when 
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Gl   Bill  expired,  and  =0  everywhere  else. 

Bill   ■  Maximum  monthly  benefit  for  a  GI  bill 

beneficiary  without  dependents,  deflated 
by  the  consumer  price  index.  Variable 
is  set  to  2ero  after  December,  1976. 

VEAf   *  Maximum  monthly  benefit  for  a  VEAP  teneficiarj 
without  dependents  ,  deflated  by  the  consuner 
price  index.  Variable  is  set  to  zero  before 
January,  1S77. 

Kick   ■  Maximum  value  of  kicker  payment  (offered 

enly  tc  Amy  enlistees  who  entered  critical 
specialties  in  combat  arms)  deflated  by 
the  consumer  price  index. 

TABCE1-  Einary  variable  =  1  from  November,  1979  to 
August  198  1  when  high  school  grads  were 
specifically  targeted,  and  0  elsewhere. 

Q3     ■  Seasonal  dummy  variable  =  1  in  July,  August 
and  September,  and  =  0  elsewhere. 

BECE1B-  Number  of  production  recruiters. 

As  with  Darling's  work,  of  some  interest  is  the  insig- 
nificant statistical  results  for  the  number  of  production 
recruiters.  Dale  and  Gilroy  do  not  conclude  that  recruiters 
are  unimportant,  bet  that  the  variation  in  the  number  of 
production  recruiters  has  not  been  large  enough  tc  shew  any 
correlation  with  enlistment  rates.  They  include  the  variable 
"TABGIT"  as  a  proxy  fcr  the  recruiters*  efforts. 

In  their  study  they  determined  that  a  rise  in  the  unem- 
ployment rate  of  16  to  19  year  old  males  led  to  a  signifi- 
cant increase  in  Army  enlistments  and  that  a  drop  in 
national  unemployment  from  ten  percent  to  nine  percent  would 
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cause  Arny  enlistments  of  male  nonprior  service  high  schcol 
graduates  to  fall  by  about  8.8  percent.  They  project  that 
the  elasticity  of  tctal  Army  enlistment  rates  with  respect 
to  relative  pay  is  ir  the  range  of  .9  to  1.7  percent  for 
mental  groups  I  to  III  upper.  They  concluded  that  a  rela- 
tive decrease  in  military  pay  cf  one  percent  in  relaticn  to 
civilian  fay  would  cause  enlistment  rates  to  fall  substan- 
tially. Also,  they  found  a  statistically  significant 
increase  in  the  rate  of  enlistments  during  the  period 
November  1S79  to  Augcst  198  1  when  the  Army  was  committed  to 
increasing  the  percentage  cf  high  school  graduates. 
Finally,  they  concluded  that  educational  benefits  are  very 
important  tc  many  high  school  graduates,  including  these  in 
the  highest  mental  grcups. 

C.   GCIEEEB6  AND  GBEEISTON  (1983) 

Gcldterg  and  Greenston  [fief.  7]  provided  a  progress 
report  en  on-gcing  time  series  cross-sectional  analysis 
research  into  the  determinants  of  high  quality  enlisted 
supply.  They  contended  that  utilizing  a  regional  data  base 
rather  than  a  naticnal  approach  permited  more  accurate 
modeling  of  demographic  factors  and  produced  better  esti- 
mates of  the  effects  of  changing  labor  market  conditions. 
They  asserted  that  by  utilizing  a  regional  approach,  a 
greater  capability  tc  optimize  the  allocation  of  gcals, 
recruiters  and  advertising  is  achieved. 

A  regression  analysis  was  used  to  estimate  the  effects 
cf  supply  factors  en  the  number  of  contracts  signed  by 
nonprior  service  male  high  school  graduates  for  each 
service.  Annual  data  for  the  time  period  1976T  to  1982, 
using  43  Navy  recruiting  districts  as  units  of  observation, 
vas  evaluated.  Seperate  regression  models  were  developed 
for  mental  groups  I  tc  III  upper  and  mental  group  III  lower. 
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Ihey  assumed  the  supply  of  HSDG  contracts  in  a  recruiting 
district  to  be  a  leg-linear  function  cf  supply  factors  in 
the  fcrm  of: 

Ln  B  =  1.51  ♦  -75  IBFAY  ♦  .05  RUNEM  ♦  .15  LAVG 
-  0-26  VEAF  ♦  0.43  1FOP  -  0.0089  BLK 
♦  .0023  URfAN  ♦  0.58  LNREC  ♦  error  term 

iilier  €: 

LBIAY   -   logarithm    cf  regular   military    compensation 
over   a   four   year   period,    divided   by  full 
time   eguivalent  earnings  of   civilian  youth. 
Military   and  civilian   earnings   are   discounted 
at  30    percent.   Civilian  earnings   are  aged 
with    data   cr   median    weekly   earnings  of    16 
tc   19    year   elds. 

RUNEM   =   Change  in    the   unemployment   rate   for  all 
civilians. 

LAVG      =  logarithm   cf  the    average  unemployment  rate 
fcr  all  civilians   in   FY    1976-82. 

VEAF      -   Dummy    variable  egual   to   zero  in   FY    1976T 
and  one   in    IY's   1977-82-   It   measures 
the  net  efiect  of    the   change   over  from 
the  GI   Bill  to  VEAF. 

LFCF      =   Logarithm   cf   a  districts    17   to   21    year   old 
male   population  in   thousands. 

BLK        ■  percent  of   a   districts    17    to   21    year   old 
male    population  that   is  black. 

UFCF      =   Fercent  of   a   district's   17    to   21    year    old 

male    population  that   resides   in   a   urban   area. 

LNRIC   =  logarithm    cf   Navy's   production   recruiters. 
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In  the  results  cf  their  findings  for  mental  groups  I 
through  III  upper,  they  concluded  that  relative  nilitary 
pay,  cyclical  unemplcyment,  total  population,  urban  mix  ard 
recruiters  increase  enlistments,  while  black  populaticc  and 
the  less  of  the  GI  Bill  cause  enlistments  to  decline.  They 
deternined  that  a  one  percent  increase  in  relative  nilitary 
pay  *ould  cause  an  increase  of  Navy  enlistments  cf  0.75 
percent  and  that  a  one  percent  increase  in  the  unemployment 
rate  would  increase  enlistments  by  between  4.3  and  5.1 
percent.  Although  they  indicated  that  their  results 
utili2ing  regional  unemployment  were  inconsistant,  it 
appealed  that  ir  general  a  rise  in  regional  unemplcyment 
caused  a  rise  in  enlistments  in  the  region. 

lhey  fcund  that  the  loss  of  the  GI  Bill  caused  a  decline 
in  the  supply  of  target  candidates  of  26  percent  to  the 
Navy.  Itey  additionally  determined  districts  having  greater 
population  tended  to  have  mere  recruiters  resulting  in  their 
separate  effects  beinc  difficult  to  measure.  Ihey  did  deter- 
mine that  a  doubling  cf  recruiters  and  population  results  in 
slightly  mere  than  a  doubling  of  enlistments,  and  that  a  cne 
percent  rise  in  the  urban  population  increased  enlistments 
by  one  tc  t*o  terths  cf  a  percent.  Overall,  they  concluded 
that  the  research  approach  using  annual  time  series,  cross- 
sectional  cata  was  fruitful  and  that  additional  research 
will  enhance  the  usefulness  of  this  model  for  forecasting 
and  policy  analysis. 


III.  AfELICATICH  CF  BOX-JEHKINS 

JL   ABAI1SIS  STEF  OBI 

Ihe  Box-Jenkins  methodology  was  applied  to  a  mcnthly 
time  series  of  nonprior  service  high  school  graduate  males 
signing  contracts  in  the  Alhuguergue,  New  Mexico  recruiting 
district  duiing  the  period  October  1978  to  September  1S83 
(60  data  points) .  Ihe  data  was  limited  to  those  contract 
signers  ir  mental  grcups  I  through  III  upper  and  represented 
net  ccntracts  signed,  that  is  new  contracts  signed  less 
Belayed  Entry  Program  (DEP)  attrites.  Figure  3.1  is  a  time 
series  plct  of  this  data.  This  plot  suggests  strong  season- 
ality in  the  number  cf  high  guality  male  enlistes  with  peak 
months  appearing  to  he  in  late  summer,  predominately  August, 
and  in  January.  An  "average"  year  was  constructed  by  aver- 
aging tie  individual  months  over  the  five  year  pericd. 
Eigure  3.2  is  the  time  series  plct  of  this  average  year.  Ihe 
average  for  each  month  is  denoted  by  the  letter  "A".  Also 
depicted  in  the  graph  is  the  average  of  the  contracts  signed 
in  the  first  three  years  of  the  data  series  denoted  ty  "1" 
and  tie  average  of  the  last  two  years  denoted  by  "2".  Ihe 
average  cf  all  the  years  ,"A",  tends  to  support  the  assump- 
tion cf  seasonality  suggested  in  the  time  seris  plot  cf  the 
data  set.  However,  in  the  plot  of  the  last  two  years  alone, 
the  suggestion  for  seasonality  is  not  as  clear  which  would 
tend  to  indicate  that  some  process  change  has  taken  place. 

Cf  seme  interest  is  the  apparent  sudden  increase  in  the 
mean  number  of  contract  signers  over  the  approximate  last 
third  cf  the  series.  Further  analysis  indicates  that  for 
the  time  series  as   a  whole   the  mean  number  of   ccntracts 
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Figure   3.1        Monthly   Contracts   Signed. 
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signed  per  month  was  49.07.  For  the  approximate  first  two 
thirds  of  the  series  the  mean  was  42.83  and  for  the  approxi- 
mate last  third  the  mean  was  62.53.  Because  of  this  unusual 
pattern,  two  additional  time  series  plots  were  constructed 
for  this  data.  The  first  was  a  time  series  pLot  for  the 
first  forty  one  data  points  (October  1978  to  February  1982) , 
figure  3.3,  and  the  second  was  for  the  last  nineteen  data 
points  (March  1982  to  September  1983),  figure  3.4.  Although 
no  obvious  trend  is  apparent  in  either  data  subset, 
both    figures  do    provide   some  indication   of    seasonality. 


Figure  3.2       Average  Recruiting   Tear. 
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lhe  next  step  in  developing  the  models  for  high  guality 
recruitment  was  to  clot  the  autocorrelations  for  the  data 
sets-  Figure  3.5  is  the  autcccrrelation  plot  developed  by 
the  Hinitab  general  purpose  statistical  computing  system  for 
the  pericd  October  1S78  to  September  1983-  An  examination  of 
these  autocorrelations  indicate  that  non-stationary  tehavior 
in  the  process  level  (a  trend)  is  evident  in  the  tie  data. 
Ihe  evidence  of  such  behavior  is  indicated  by  having  auto- 
correlations which  dc  not  die  out  rapidly  to  zero-  Although 
the  autocorrelations  for  this  data  indicates  a  general  trend 
in  the  data,  the  tine  series  plots  and  the  mean  difference 
in  the  two  data  subsets  tend  to  suggest  not  an  overall 
trend,  tut  a  sudden  and  sustained  overall  increase  in  the 
number  of  contracts  signed,  by  the  target  cohort-  Ibis  nay 
te  caused  by  soae  yet  to  be  determined  external  influence. 
An  evaluation  of  this  sudden  and  sustained  increase  in 
contract  signers  will  be  addressed  in  Chapter  Four. 

l«c  separate  courses  of  analysis  were  pursued.  First,  in 
the  event  that  the  cause  of  the  sudden  increase  in  the  mean 
could  net  be  identified  and  Quantified,  a  model  which  has 
adequate  to  describe  the  data  set  as  a  whole  was  sought. 
Second,  in  the  event  that  the  cause  could  be  identified, 
its*  affect  quantified  and  any  future  occurrance  predicted, 
a  single  model  which  satisfactorily  described  the  underlying 
data  patterns  of  each  subset  around  their  mean  was  sought. 
If  the  cause  of  this  sudden  change  could  be  determined  or 
controlled,  then  the  second  model  could  possibly  be  used  to 
more  acccrately  forecast  future  contract  signers. 
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figure  3.3   Monthly  Contracts  (Oct  78  -  Feb  82). 

E.   AUICCOBEELATIOHS  1HD  PABTIAI  AUTOCOBBELATIONS  EXAMINED 

1-   Cctcber  J978  to  September  1983 

As  evidenced  by  figure  3.5,  there  is  a  suggested 
trend  for  this  data  set  due  to  the  increase  in  the  mean 
number  of  contracts  signed.  As  the  Box-Jenkins  methodology 
requires  stationary  data  to  be  effective,  differencing1  Mas 
used.  Ihis  data  set  was  differenced  once  and  the  resulting 
autocorrelations  and  partial  autocorrelations,  as  repre- 
sented ir  figures  3.6  and  3.7,  were  examined. 


differencing  is  accomplished  by  subtracting  the  first 
value  from  the  second,  the  second  from  the  third,  the  third 
from  the  fourth,  etc.  Differencing  is  the  most  rapid  method 
cf  removing  a  trend  from  a  time  series  [fief.  81. 
Differencing  may  be  performed  once  or  more  in  order  to 
remove  the  trend,  however  differencing  once  is  usually 
adeguate. 
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Figure  3.4    BonthJy  Contracts  (flar  1982  tc  Sep  1983). 

Three  statistically  significant  data  points  are 
observed  in  figure  3.6:  the  first,  the  twelfth  and  the  thir- 
teenth. Overall,  this  plot  suggests  that  the  trend  has  been 
removed  and  that  only  horizontal,  stationary,  data  remains. 
Ihe  strorg  positive  autocorrelation  at  the  twelfth  period 
supports  the  seasonality  suggested  by  figures  3.1  and  3.2 

*•   October   197  6  to   February  J982   and   Maf£h  198  2   to 
September  .1963 

Figures  3.8  through  3-11  are  the  autocorrelations 
and  partial  autocorrelations  for  the  periods  October  1 S "78  to 
Pebruary  1962  and  March  1982  tc  September  1983  respectively. 
These  clots  suggest  that  both  subsections  of  the  data  set 
are  trcndless,  with  a  suggestion  of  seasonality  evident  only 
in  the  October  1978  tc  February  1982  data.    Identification 
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Figure  3.5    AU10COBBE1ATIONS. 
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Figure  3.6    AU1CCOBBEIATICNS  OF  DIFFEBEHCED  DATA. 
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Figure  3.8    AC1CCOBBEIA1ICMS  (OCT  78  -  FEB  82). 

and  estimation  of   tie  seasonal  nature  of   these  two  sutsets 
is  very  difficult  because  of  the  brevity  of  the  time  series. 
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1 
2 

3 

a 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 


0.045 
0.292 
0.012 
C.052 
0.127 
0.056 
0.335 
0.132 
0.222 
C.096 
0.067 
0.095 
C.081 
C.300 
0.153 


0.6 
t 


-0.4   -0.2 
» —  f  - 


0.0    0.2 
._-► 1_. 

XX 
XXXXXXXX 

X 
XX 

xxxx 

XX 

xxxxxxxx 
xxxx 

XXXXXXXX' 
XXX 
XXX 
XXX 
XXX 

xxxxxxxx 
xxxxx 

-2SE  +2SE 


0.4 
_  I 


c.e 

.1 


figure   3.10         AMCCOBBEIA1IONS    (HAS    82   -   SEP    83) 
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Figure  3.11    PARTIAL  AOTOCOBBEIATIOMS  (BAB  82  -  Sli    83). 

C.   HCIfl  DEYEIOEflEHl 

Ie  comparing  the  autocorrelations  and  partial  autocorre- 
lations to  the  various  AfiMA  models  in  figures  A. 5  through 
A. 13,  a  moving  average  model  is  suggested  for  the  data  set 
for  Cctcber  1978  to  September  1983  as  well  as  for  each 
subset  of  the  data.  However,  which  particular  MA  model  would 
test  describe  the  underlying  data  pattern  in  each  case  is 
unclear. 

1.   Cctcber  1978  to  September  1981 

laking  into  consideration  the  apparent  seasonality 
of  the  data  set,  the  apparent  trend  in  the  data  anc  the 
uncertainty  as  to  which  specific  HA  model  would  be  most 
appropriate  ,  two  models  were  developed  for  evaluation  tased 
en  a  visual  inspection  of  the  autocorrelations  and  partial 
autocorrelations. 


Z  A 


a.    first  Order,    Seasonal,    Moving   Average   Model   with    the 
fori: 


YT-YTi  =  ^YT=(1-e,B)  (1-e,tBi2)eT  (egn    3.1) 


r.   Second  Crdei,   Seasonal,   Moving  Average  Model  with 
the  form: 


Yt-yt-»=Ayt  =  (1-e,B-94B2)  (1-e,aB»2)eT       (egn  3.2) 

Ehere  e   is  the  erroi  term. 

*•   October  J978  tc  February  19  82 

In  developing  the  models  to  be  tested  for  this  data 
subset,  the  absence  of  an  apparent  trend,  the  suggested 
seasocalitv,  and  the  uncertainty  as  to  which  particular 
coving  average  model  would  best  describe  the  underlying  data 
patterns  were  taken  into  consideration.  Two  models  were 
selected  for  evaluation  based  on  a  visual  inspection  cf  the 
autoccrrelations  and  partial  autocorrelations. 

a.  first  Order,  Seasonal,  Moving  Average  Model  with  the 
fern: 

V  ■  (1-0.B)  (1-e,zBi2)er  (egn  3.3) 


r.   Second  Order,   Seasonal,   Moving  Average  Model  with 
the  form: 
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\    =  (1-e.B-e2E2)  (1-  e,xEl2)eT  (egn  3.4) 

Where  e  is  the  erroi  term. 

3.   March  J9  62  tc  Sept.ej.ber  J983 

Id  developing  the  models  to  be  tested  for  this  data 
subset,  the  absence  of  an  apparent  trend,  insufficient 
length  in  the  time  series  to  exhibit  seasonality,  and  the 
uncertainty  as  to  which  particular  moving  average  model 
would  test  describe  the  data  patterns  were  taken  into 
consideration.  Twc  models  were  selected  for  evaluation 
based  on  a  visual  inspection  of  the  autocorrelations  and 
partial  autocorrelations. 

a.  First  Order  Mcving  Average  Model  with  the  form: 


lr=(1-e,B)eT  (egn  3.5) 


t.    Second   Order    loving    Average   Model   with  the   form: 


Xt    -O-  e,E-  ep)eT  (egn    3.6) 


Rhere  e  is  the  error  term. 


C.   HCC£I  EvALUAIIOH 


The   models  developed   were   evaluated  utilizing   the 
Minitab  general  purpose  statistical  computing  system  command 
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"ARIfl-A"  (Autoregressive  Integrated  Moving  Average).  2  in 
crder  for  a  model  tc  be  considered  adequate  the  residuals 
(errors)  of  the  estimates  generated  by  the  model  would  be 
required  to  be  randca  (unccrrelated)  with  a  mean  of  zero. 
If  each  cf  the  models  developed  for  each  specific  data  set 
satisfied  this  requirement,  the  sum  of  squared  errors  and 
the  1-ratics  for  each  model  would  be  examined  to  determine 
the  mcst  appropriate  iiodel.  The  results  of  this  evaluation 
are  contained  in  Table  I. 

I.   BISDI1S  OF  THE  UCIEL  E¥AIUATIOH 

1.   Cctcber  J978  to  Septemrer  1983 

As  evidenced  by  table  I,  both  models  considered 
satisfied  the  reguireient  of  random  residuals.  There  exists 
little  difference  between  the  two  models  to  suggest  which  is 
the  icre  appropriate,  although  confidence  in  the  second 
model  is  reduced  by  the  low  T-ratio  for  the  MA  (2)  operatcr. 
In  this  case  the  prirciple  cf  parsimony3  was  invoked  and  the 
selected  model  was  the  MA(1)  with  the  form: 


YT  =  (1-.64UB)  (  H.448B*2)  eT  (egn  3.7) 


<-  101)    (.  U2)     std.  error 


2lhe  concepts  and  notations  used  by  this  com mand.  fellow 
the  ones  developed  by  Box  and  Jenkins.  The  values  of  <D  and  Q 
are  determined  utilizing  a  nonlinear  least  squares  algorithm 
developed  ty  Marguart.  The  ABIMA  command  fits  non-seasonal 
and  seescnal  models  to  a  time  series  with  input  consisting 
cf  a  stored  time  series  and  information  as  to  the  degree  or 
the  AE,  HA  or  ARMA  mcdel  to  be  fitted  £Bef.  9]. 

33he  principle  of  parsimony  cautions  to  attempt  tc  use 
the  snallest  possible  number  or  parameters  which  represent 
the  mcdel  adequately  £Bef.  10]. 
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2.      Cctcber   J978   to   February    1982 

lhe  two  models  selected  ior  this  data  subset  were 
run  hith  results  displayed  in  Table  I.  The  residuals  for 
roth  models  met  the  randomness  requirement,  however  the  low 
1-ratics  fcr  the  non-seasonal  coefficients  in  both  models 
has      less      than    encouraging.  As      a      result   of      these      low 

T-ratios,    another    model,    SMA(1),      was   attempted-      This    model 
has    the    form: 


YT    *   0  *(1-  6t^l2)*r  (egn   3.8) 


An  examination  of  Table  I  for  this  data  set  indi- 
cates little  to  distinguish  one  model  from  another  in  terms 
of  mere  adequately  describing  the  data  patterns.  Again  the 
rule  of  parsimony  was  used.  The  model  selected  was  the 
£MA(1)  mcdel  with  the  form; 


Y   =  42.46  *    (H.776Bi2)eT  (egn  3.9) 

(1.78)     (-2C2)     std.  error 

3.   fijich  19  82  tc  September  1983 

Here,  again,  each  model  selected  for  evaluation 
satisfied  the  requirement  fcr  random  residuals.  The  signifi- 
cant difference  between  these  two  models  is  in  the  residual 
sum  cf  squared  errors,  where  the  HA (2)  model  is  clearly 
supericr  to  the  MA(1)  model.  Also  encouraging  is  the  signif- 
icant I-ratio  exhibited  by  the  MA  (2)  coefficient  in  the 
second  mcdel.  lhe  mcdel  selected  for  forecasting  purposes 
has: 
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Y   =  62-S5  ♦  (1-.25CB-.745B2)eT  (egn  3.10) 

(0.23)     (.2C6)  (.208)  std.  error 

I.   FCBICAS1ING  WITH  IHE  SEIEC1ED  MODELS 

latle  II  and  figures  3.12  through  3.15  represent  the 
results  cf  the  forecasts  of  the  selected  models.  Ih€  actual 
values  fcr  the  net  nuafcer  of  target  cohort  contracts  signed 
and  tie  forecasted  values  are  displayed  vithin  each  tarle 
secticn.  The  95  percent  limits  displayed  indicate  that  cne 
should  he  certain  that  the  actual  value  will  fall  tetueen 
these  tuc  values  95  percent  of  the  time.  In  figures  3.12 
through  3.15,  the  forecast  results  are  displayed  graphi- 
cally. Ihe  actual  net  contracts  signed  are  dipicted  fcy  the 
letter  "A",  the  forecasted  values  by  the  letter  "F"  and  the 
upper  and  lewer  95  percent  confidence  limits  by  the  letters 
"U"  and  "1"  respectively. 

larle  II  is  divided  into  four  sections.  Ihe  first 
secticn,  corresponding  to  figure  3.12,  is  a  forecast  fcr  six 
periods  ahead  fcr  tie  period  October  1983  to  March  1S8U 
rased  on  the  October  1978  to  September  1983  time  series 
utilizing  the  seasonal  MA(1)D  model  previously  selected  for 
this  tine  series.  Kith  the  exception  of  the  March  1S84 
forecast,  all  actual  values  fall  within  the  95  percent 
confidence  limits,  houever,  these  confidence  limits  are  too 
wide  to  re  cf  much  practical  value.  As  evidenced  ty  figure 
3.12,  the  model  dees  generally  capture  the  pattern  of 
enlistments  for  the  forecasted  period,  although  the  degree 
of  the  decrease  for  March  1984  is  not  well  represented.  Ihe 
variation  in  the  forecasting  errors,  ranging  froi  a  lew  of 
♦5-0  percent  to  a  high  of  +67.4  percent  are  not  particularly 
encouraging. 
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In  the  second  section,  corresponding  to  figure  3.13,  the 
forecasts  are  conducted  on  a  month  to  month  basis  utilizing 
the  seguential*  forecasting  method.  For  example,  the  initial 
forecasts  fcr  this  data  set  were  conducted  utilizicg  60  data 
points.  When  the  6  1  st  data  point  became  availatle,  the 
parameters  cf  the  same  model  specification  were  reestimated 
using  61  data  points  and  a  forecast  for  period  62  was 
conducted.  Forecasts  were  conducted  in  this  manner  for  the 
pericd  October  1983  tc  March  1984.  As  evidenced  ty  section 
Two  of  the  table,  fcrecasting  with  this  method  produces 
superior  results  to  fcrecasting  multiple  periods  ahead.  For 
the  nest  part,  forecasted  values  well  represent  the  actual 
values,  although  here  again  the  model  fails  to  adequately 
represent  tie  sudden  decrease  for  March  1984. 

In  sections  three  and  fcur  of  Table  II,  corresponding  to 
figures  3.14  and  3.15  respectfully,  the  forecasts  cf  the 
MA(2)  mcdel  developed  for  the  March  1982  to  September  1S83 
time  series  are  examined.  Section  three,  as  in  secticr  cne 
cf  the  table,  is  a  forecast  for  six  periods  ahead.  Ihe 
obvicus  prcblem  of  attempting  to  forecast  future  periods 
with  a  rcn-dif f erenced  MA  model  is  the  models  tendency  to 
forecast  the  mean  cf  the  time  series  after  only  a  few 
perieds,  precluding  any  meaningful  evaluation  of  these  fore- 
casts. Observing  the  first  two  forecasts  for  the  period 
October  1983  to  March  1984,  the  results  are  not  very  encour- 
aging. Ihe  errcrs  fcr  these  two  periods  are  much  toe  large 
to  be  cf  any  practical  application. 

In  section  four  the  forecasts  were  conducted  with  the 
sequential  forecasting  method  utilizing  the  March  1982  to 
September  1S83  time  series.   With   only  one  forecasted  value 


♦As  soon  as  the  value  of  the  next  data  point  becomes 
available,  all  forecasts  for  future  periods  are  updated 
utilizing  reestimated  parameter  values  of  the  ABMA  model,  as 
opposed  tc  the  adaptive  method  in  which  the  ABMA  model 
parameters  remain  urcXanged. 
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falling  within  ten  percent  cf  the  actual  value,  and  with  the 
forecasts  being  condccted  only  cne  period  ahead,  the  nodel 
would  appear  to  have  extremely  limited  usefulness. 

Cf  tie  two  models  developed,  the  Seasonal  MA(1)E  model 
developed  for  the  October  1978  to  September  1983  time  series 
would  seem  to  be  the  most  useful  in  forecasting  future  net 
values  of  high  quality  recruits.  However,  as  evidenced  by 
Table  II  and  Figures  2.12  and  3.13,  sudden  major  swings  in 
the  values  cf  contracts  signed  are  not  adequately  predicted 
by  this  model.  In  Chapter  Four  a  causal  model  utilizing 
regression  analysis  will  be  examined  for  applicability,  and 
to  deternine  if  sudden  shifts  in  the  dependent  variable  will 
be  adeguately  described  by  a  causal  model. 
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TAfilE   I 
MO  I II    EVAIOAIION    RESULTS 


Qctotjr  JS78   to  September   JS83 

MA(  1)D 


MA(2)  D 


Besiduals: 
Mean: 
Std   Lev: 

Si  gallic a nee    Level 
for    Mean: 

Ban  dcm 
♦  0.  80 
8.  89 

2.32 

fiANDOM 
♦  1.03 
8.66 

2.26 

fiesicual 
Sum   of   Sguares: 

4619.61 

4416.89 

T-Bat. 
MAM] 
MA<2, 

SMA  (1 

Los: 

♦6.  39 

NA 

-2.  76 

♦  3.46 

♦  1.72 
-3.01 

Qctcher  JS78   to  Zeircar^  J.S82 

MA(1) 

Besiduals: 
Meai: 
Std  Dev: 

Significarce  level 
for  Mean: 

Besicual 

Sum   of   Sguares: 

T-Batios: 
MA  (1( 
MA    2 

sma  h; 


MA  (2) 


Constant: 


42.37 


42.43 


Actual    lean  for  times   series:    42.33 

March  JS82   to   Segteiier  19  83 

MA(  1) 


MA(2) 


Besiduals: 

Mean: 
Std    Eev: 

Ban  dom 
-0.  02 
8.  48 

Random 
♦  0.42 
7.  13 

Sigsif icacce  level 
for    Mean: 

3.  89 

3.27 

Besicual 
Sum   of   Sguares: 

1294.39 

918.04 

T-Batics: 
MA  (1) 
MA  \2) 

♦0.  54 
NA 

+  1.20 
♦  3.59 

Constant: 


62.  47 


62.95 


SMA  (1) 


Ban  dcm 
♦0.  76 
6-  27 

fiandom 
+  0.73 
6.35 

fiandem 
♦  0.75 
6.37 

1.  96 

1.98 

2.00 

1594.65 

1634.77 

1646.43 

-1.05 

NA 

-3.  52 

-0.91 
-0.03 
-3.73 

NA 

NA 

-3.83 

42.46 


Actual   neari  for   the    time  series:    62.53 
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TABLE    II 
FORECAST    RESULTS 

OCTCEEB    1S78    TC    SEPTEMBER     1963    DATA    SET    FORECASTS 

FORECASTING   SIX    PERIODS    AHEAE 

95    PERCENT    LIMITS 


MONTH 

FORECAST 

LOWER 

UPPER 

ACTUAL 

%    ERROR 

OCT    63 

65.7 

48.  1 

83.4 

53 

+  24.0 

NOV    63 

64.4 

45.  6 

83.  1 

52 

+  23.8 

DEC    S3 

65.8 

46.  0 

85.6 

59 

+  11.5 

JAN    64 

63.4 

42.  7 

84.  1 

59 

♦  C7.5 

FEE    64 

58.8 

37.  2 

80.5 

56 

+  05.0 

MAE    64 

56.9 

34.  3 

79.4 

34 

+  67.4 

SECOENTIAI 

FOfiECASTING    METHOD 

95    PERCENT   LIMITS 

MONTH 

FORECAST 

LOWEB 

UPPER 

ACTUAL 

%    ERROR 

OCT    83 

65.7 

48.  1 

83.4 

53 

♦  24.0 

NOV    83 

59.8 

41.  9 

77.6 

52 

+  15-0 

DEC    83 

58.5 

40.  8 

76.3 

59 

-CO. 8 

JAN    64 

56.6 

38.9 

74.2 

59 

-04.1 

FEB    64 

53.6 

36.  1 

71.1 

56 

-04. 3 

MAfi    64 

53.0 

35.  6 

70.4 

34 

+  55.9 

MARCH    1982   TO    SEPTIMEER    1983    DATA    SET    FORECASTS 


FOBICASIING    SIX    PERIODS    AHEAD 


95    PERCENT    LIMITS 


MONTH 

FORECAST 

LOWER 

UPPER 

ACTUAL 

*    ERROR 

OCT    63 

69.2 

54.3 

84.0 

53 

+  30.6 

NOV    83 

58.8 

43.  5 

74.  1 

52 

♦  13.1 

DEC    63 

63.0 

44.  1 

81.8 

59 

+  C6.8 

JAN    64 

63.0 

44.  1 

81.8 

59 

+  06.8 

FEE    64 

63.0 

44.  1 

81.8 

56 

+  12.5 

MAfi    64 

63.0 

44.  1 

81.8 

34 

+  85.3 

SECOENTIAI   FOfiECASTING    METHOD 


95    PERCENT    LIMITS 


MONTH 

FOBECASI 

LOWEE 

UPPER 

ACTUAL 

%    EfifiOR 

OCT    63 

69.2 

54.3 

84.0 

53 

♦  30.6 

NOV    83 

61.3 

44.  2 

78.5 

52 

+  17.9 

DEC    S3 

67.5 

50.  5 

84.5 

59 

+  14.4 

JAN    64 

63.1 

46.  2 

79.9 

59 

+  C6.9 

FEE    84 

62.6 

46.  1 

79.1 

56 

+  11.8 

MAfi    84 

61.5 

45.  2 

77.8 

34 

+  60.8 
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Figure  3.12    MA(1)  SIX  MONTH  FORECAST. 
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Figure    3.13        HA(1)    SEQUENTIAL    FOBECASTS 
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Figure  3.14    MA(2)  SIX  MONTH  FOBECAST. 
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IV.  IHE  H EGRESSION  MODEL 

1.   E1SIC  A£SUHPTION£ 

It  vas  assumed  that  the  forecasts  generated  with  the 
ABIMA  model  presented  in  Chapter  3  could  be  improved  ufcn  by 
developing  a  model  fchich  took  into  consideration  causal 
relationships.  The  factors  assumed  to  most  actively  icflu- 
ence  tie  net  number  cf  high  quality  nonprior  service  males 
opting  tc  jcin  the  Kavy  was  conjectured  to  be:  number  of 
production  recruiters,  seme  measure  of  unemployment,  size  of 
the  work  force,  military/civilian  pay  ratio  and  recruiting 
goals. 

The  utilization  cf  the  regression  model  requires  that 
four  hasic  assumpticrs  be  made.  First,  that  the  dependent 
variahle,  ret  contracts  signed,  is  linearly  related  tc  each 
of  the  independent  variables.  If  the  relationships  are  not 
linear,  then  regression  analysis  cannot  accurately  be 
applied  to  the  probleo.  Second,  the  variance  of  the  resi- 
duals (eirors)  cf  tie  regression  equation  remains  ccrstant 
ever  tie  tine  series.  Ihird,  that  the  residuals  are  indepen- 
dent cf  cne  another,  that  is  any  given  residual  shculd  not 
he  a  function  of  the  error  terms  preceeding  or  follcwicg  it 
in  the  time  series.  The  existence  of  such  a  relationship, 
called  serial  (auto)  correlation,  implies  that  either  an 
important  independent  variable  has  been  omitted  or  the  vrcng 
functicnal  form  has  teen  used  in  the  regression  eguaticn. 
Generally  fcr  a  data  set  of  the  size  used  in  this  study,  a 
Eurbir-Batscn  statistic  value  tetween  1.5  and  2.5  implies  a 
lack  cf  serial  correlation.  Ihe  final  tasic  assumption  is 
that  the  residuals  are  normally  distributed. 
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E,   VJBIJEIES  CONSIDEBED  FOB  IJiCIOSION 

1  •   K€ t  Con  tract s  Signed :  CNIBCTS 

As  previously  defined,  the  dependent  variable  is  the 
total  number  of  cortracts  signed  per  month  by  ncnprior 
service  high  school  graduate  males  in  mental  groups  I 
through  III  upper  within  the  Albuguergue,  New  Mexico 
Eecruiting  Eistrict  less  the  number  of  Eelayed  Entry  Prcgram 
(DEB)  attrites  of  the  same  cchort  during  the  same  menth. 
This  data  was  obtained  from  Navy  Recruiting  Command  for  the 
period  October  1S78  tc  September  1983. 

2-   The  Number  of  Production  Becruiters;  RCTRS 

The  values  fcr  this  variable  are  the  number  of  Navy 
Becruiters  who  were  actually  on  duty  in  Navy  Recruiting 
Eistrict  Albuguergue  during  the  period  of  the  time  series. 
They  represent  actual  canvassers  and  exclude  staff 
personnel.  This  data  was  cttained  from  Navy  Recruiting 
Command.  It  would  seem  to  be  logical  to  expect  an  increase 
in  the  runter  of  picduction  recruiters  to  result  in  an 
increase  in  the  number  of  contracts  signed,  subject  to 
effects  cf  marginal  productivity . 

3.   Numbers  of  Oremployed;  UNEMPID 

The  values  fcr  this  variable  represent  the  nunbers 
cf  people  determined  to  be  unemployed  by  the  0.  S. 
Eepartment  of  Labor.  The  values  of  this  variable  were 
divided  by  10,000  so  that  the  data  could  be  manipulated  mere 
easily.  These  values  were  divided  by  10,000  so  that  the 
data  could  be  manipulated  more  easily.  These  values  were 
built  up  from  a  county  level  and  represent  the  seasonally 
adjusted  number  of  uremployed  within  the  target  recruiting 
district  as  a  whole.  No  breakdown  by  sex  or  age  group  was 
available.   This  data  was  obtained  from  Navy   Recruiting 
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Commard.  It  was  assumed  that  an  increase  or  decrease  in  the 
numbers  of  uneiplcjed  would  result  in  a  correspcnding 
increase  or  decrease  in  the  number  of  contracts  signed, 

**-   Numbers  in  tie  Work  force;  WKFRCE 

Ihe  values  icr  this  variable  represent  the  numbers 
in  the  wcrk  force,  enployed  and  unemployed,  as  determined  by 
the  U.S.  Eepartment  cf  Labor.  The  values  of  this  variatle 
were  built  up  frcm  a  county  level  and  represent  the  seascn- 
ally  adjested  work  force  in  the  target  recruiting  district 
in  tctal.  This  data  was  ottained  from  Navy  Recruiting 
Commard.  As  the  values  for  this  variable  increased  or 
decreased,  it  was  expected  that  the  numbers  of  contracts 
signed  wculd  increase  or  decrease  correspondingly. 

5-   Military/Civilian  Pay  Ratio;  PAYRATIO 

The  merits  cf  any  constructed  military  /  civilian 
pay  ratio  could  be  argued  indefinitely  with  no  concrete 
results  as  to  a  most  meaningful  measure.  For  the  purposes 
of  this  study,  the  military  component  of  the  ratio  was 
calculated  as  monthly  base  pay  for  an  E-1  with  mere  than 
three  menths  service.  The  civilian  component  utilized  was 
average  weekly  earnings  of  production  workers  in  the  state 
cf  New  Mexico  multiplied  by  four.  The  average  weekly  earn- 
ings figure  was  obtained  frcm  "Employment  and  Earnings" 
published  b}  the  U.S.  Department  of  Labor;  Eureau  cf  laror 
Statistics.  Intuitively,  an  increase  in  the  military/ 
civilian  pay  ratio  wculd  result  in  an  increase  in  ccntracts 
signed. 

6  •   Percent  One alloyed:  PEECUNE 

The  values  of  this  variatle  are  UNEMPLD/WKFRCE.  This 
variatle  represents  the  percentage  unemployed  in  the  target 
recruiting  district.  This  variable  was  included  in  the  study 
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to  detemine  if  the  percentage  of  unemployed  within  the  wcrk 
force  acre  accurately  predicted  the  number  of  contracts 
signed  than  the  crdiral  value  cf  unemployed. 

7-  Change  in  Oneacloyjnent;  DELTA  UN 

Ihe  Values  for  this  variable  represent  the  month  to  month 
changes  (+/-)  of  the  variable  PEBCUNE  for  the  target 
recruiting  district.  This  variable  was  included  in  the 
study  to  determine  if  changes  in  the  unemployment  rate  could 
signal  corresponding  changes  in  the  number  of  contracts 
signed. 

8-  Cneaployment   fiatio;    UNiMRAX 

Ihe  values  of  this  variable  represent  a  ratio  of  the  change 
in  unemployment  frcn  month  to  month  within  the  target 
recruiting  district  as  expressed  by  the  following  formula: 

UNEiflD  in  month  T  -  UNEMPLD  in  month  1-1 
DKIMEA1  = 

UNEHPLD  in  month  T-1 

It  was  hcped  that  this  variable  could  capture  the  effects  of 
roth  tie  percentage  unemployed  and  the  change  in  unemploy- 
ment en  a  menthly  basis. 

9  •  £2 § !§.  Z   £uotas 

Dnf crtunately,  there  was  insufficient  data  en  goals 
for  the  target  cohort  to  examine  goals  for  inclusion  in  the 
regression  equation.  Intuitively,  goaling  would  seem  to  play 
a  key  rele  in  determining  the  number  of  contracts  signed.  A 
recruiting  district  which  had  achieved  target  guotas  for  the 
month  wocld  be  tempted  to  delay  additional  accessions  until 
the  fcllcwing  month  to  help  insure  the  attainment  cf  goals 
en  a  continuing  basis.  Additionally,  goals  would  seem  to 
mask  the  supply  function  of  recruiting  making   the  study  of 
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the  supply  problem  much  mere  difficult.5  It  is  the 
hypothesis  of  this  author  that  a  detailed  study  cf  the 
effects  cf  goals  on  enlistment  contracts  signed  would  indi- 
cate a  very  high  pesitive  relationship  bewteen  these  two 
variables.  Ihis  is  ar  area  which  demands  increased  attention 
and  studj. 

C.   TEE  EEGEESSICM  BCEEL  DEVEICfED 

In  determining  the  final  choice  for  the  explanatory 
variarles,  it  was  recessary  to  determine  the  lag  or  lead 
effect,  if  any,  that  was  exerted  on  the  dependent  variable 
ty  each  independent  variable.  For  example,  an  increase  in 
the  number  of  production  recruiters  may  not  affect  the 
number  of  contracts  signed  for  several  months,  assuming  that 
there  is  a  learning  curve  associated  with  the  pesiticn. 
Similarly,  an  increase  in  the  unemployment  rate  may  not 
affect  enlistments  fcr  several  months.  All  of  the  indepen- 
dent variables  previously  described  were  given  a  lag  and 
lead  cf  ficm  one  to  six  months  and  simple  regressions  were 
conducted  tc  determine  which  increment  of  lag  or  lead  best 
explained  the  variation  in  the  number  of  contracts  signed.  A 
variable  was  allowed  to  enter  the  equation  only  once.  That 
is  if,  fcr  example,  PERCOME  evidenced  a  significant  influ- 
ence en  CNIECTS  laggec  one  month  and  lagged  two  months,  only 
the  lag  with  the  most  significant  coefficient  of  determina- 
tion, E-sguared,  was  included  in  the  regression  eguation.6 


5£iegel  and  Eorack  [fief.  2]  provide  an  excellent  dicus- 
sion  cf  the  interaction  between  goals  and  the  recruiting 
effort. 

61he  decision  to  utilize  a  single  lag  or  lead  for  each 
variable  was  made  prior  to  conducting  the  initial  regres- 
sion. Hcwever,  during  the  the  analysis,  all  variable  lags 
and  leads  with  a  T-ratio  greater  than  2.00  for  a  simple 
regressicn  were  fitted  into  a  nultiple  regression.  The  coef- 
ficient cf  determination  of  this  multiple  regression  was  not 
significantly  different  frcm  the  coefficient  of  determina- 
tion for  the  multiple  regression  utilizing  a  single  test  fit 
lag  ex  lead  for  each  variable. 

50 


A  multiple  regression  model  utilizing  the  lag  or  lead  of 
each  variatle  with  the  highest  coefficient  of  determination 
was  constructed,  and  a  regression  conducted  on  the  dependent 
variatle  net  contracts   signed   with  results   as  follows: 

Y=    -42.6    ♦    .405    SC3ES1    ♦    6.0    UNEMPLD1    -    1.0    HKFECE1 

♦  256. C  PAYRAT04   -    241.0    EERCUNE2    ♦    28.2   DELTAUN 

♦  16.8    0NEMRAT2    ♦   a  (eqn    4.1) 

where: 

RC3BS1        =  numter    cf   production   recruiters   lagged    1   mcnth 

UHEtEiri    =  unemployment  lagged    1   month 

WKIBCE1      =  numcers   in   the    wcrk   force  lagged   1    month 

PAYBAIC4   =  military  /   civilian   payratio   lagged    4    months 

PEECUNE2   =  percent   unemployed   lagged  2   months 

DEI1A0N      =  change    in   unemployment  with   no  lag 

UNEHEA32    =  unemployment  ratio  lagged   2   months 

a  =  residual 


ST.  DEV. 

I- 

•RATIO* 

COICMN 

COEFFICIENT 

OF  COEF. 

COEF/S.E. 

COKSTAKT 

-42.6 

158.9 

-0.27 

RC1BS1 

0.4051 

0.2668 

1.52 

0NEBPLE1 

5.S7 

17.81 

0.33 

WKFBCE1 

-0.51 

1.187 

-0.43 

PAYBilC4 

256.17 

96.  18 

2.66 

PEECUNE2 

-241 

2498 

-0.  10 

DEI1AUN 

28.23 

10.26 

2.75 

UNECEA12 

16.81 

11.52 

1.46 

THE  S3.  £EV.  OF  Y  AECCI  THE  REGRESSION  LINE  IS 
£=  8.030 

E-SCOAREr  -  62.2  PERCENT 

fi-SCOJEEE  =  56.7  PERCENT,  AEJ0STED  FOR  D.F. 
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ANALYSIS    CF    VARIANCE 


DUE    10 

DF 

SS 

REGRESSION 

7 

5097.30 

EESIEGAI 

48 

3094.  8<J 

3CTAI 

55 

6  192.14 

as=ss/DF 

728.19 

64.48 


Those  variatles  kith  negative  T-ratios  were  excluded 
from  consideration  in  the  final  regression  model  as  having 
counter  intuitive  results.  The  regression  was  conducted  a 
second  tine  utilizing  the  variables:  RCTRS1,  UNEMELE1, 
FAYRA304,  LllThUK,  arc  UNEMRAI2.  The  results  of  this  secend 
regression  were  exaained  and  those  variables  having  a 
1-ratio7  less  than  2.00  were  eliminated  from  consideration 
in  the  fiial  model.  A  third  multiple  regression  was  then 
conducted    with    the    following   results: 

Y=-61.e*U.0NEMPLDU19c.PAYRATC4*19.8DELTAUN«-a    (eqn    4.2) 

hhere: 

0NEEELI1    =  uneiplcyaent  lagged    1    month 

PAYEA1C4   =  military/civilian   payratio   lagged  4    months 

DEI1AUK      ■  change    in   unemployment  with   no   lag 

a  =  residual 

ST.    DEV.  1-RA1IC    = 

CCI0MN  COEIFICIENT  OF    COEF.  COEI/S.D. 

CCKS3ANT  -61.84  35.11  -1.76 

UNEMELD1  3.670  0.545  6.61 

fAYRAT04  193.26  90.35  2.14 

EE11A0N  19.785  9.410  2.10 

3HE    S3.    LEV.   OF    Y    ABCC1    REGRESSION    LINE    IS 
S   -    8.112 


7A  high  T-ratio  indicates  that  the  independent  variatle 
is  imjertant  in  explaining  the  value  of  Y.  The  higher  the 
3-ratic,  the  more  unlikely  that  the  b  coefficient  is  a 
randem   variation  froi  zero. 
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B-SCDABEE  =  58.2  PEBCIN1 

£-S£OJB£I  '    55.8  PEECINT,  AIJOSTED  FOB  D.F 

ANALYSIS  OF  VABIANCE 


DUE    1C 

DF 

SS 

MS=SS/DF 

BEGBESSICN 

3 

4770.27 

1590. C9 

BESIICA1 

52 

4*121.88 

65.81 

TOTAL 

55 

8192.15 

The  signs  of  th€  coefficients  of  the  explanatory  vari- 
ables ai€  what  one  vculd  expect  intuitively.  As  unemp lcyment 
increases,  so  does  the  number  of  contracts  signed.  An 
increase  in  the  military/civilian  pay  ratio  will  cause  an 
increase  in  the  number  of  contracts  signed,  and  as  the 
percentage  unemployment  changes,  the  numbers  of  contracts 
signed  change  accordingly. 

One  cf  the  mere  surprising  results  of  the  regression  was 
the  exclusion  of  the  variable  BCIBS,  representing  the  number 
cf  prcducticn  recruiters.  When  forced  into  the  regression 
equation,  the  coefficient  for  this  variable  was  never 
significantly  differed  from  zeio.  Experience  and  intuition 
«ould  indicate  that  recruiters  play  an  important  role  in  the 
numbers  cf  high  guality  nonprior  service  males  enlisting  in 
the  Havy. 

There  may  be  several  explanations  for  the  results  cf  the 
regression  in  regards  to  recruiters.  In  examining  the  time 
series  clots  for  CNTECTS  and  BCTBS  contained  in  Appendix  B, 
it  is  evident  that  ever  the  approximate  last  half  cf  the 
period  the  numbers  cf  contracts  signed  is  increasing  as  the 
cumber  cf  producticn  recruiters  is  decreasing.  It  is 
possible  that  as  the  Navy  gains  more  experience  in  the 
selection  cf  recruiters,  better  recruiters  are  placed  with 
the  recruiting  districts,  resulting  in  higher  individual 
recruiter  production.   Also,  it  is  possible  that  the  various 
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services  compete  with  each  ether  for  the  target  cchcrt  with 
approximately  equal  determination  thus  canceling  the 
apparent  overall  influence  of  the  recruiters.  It  is  also 
possible  that  the  target  cohort  is  not  as  easily  influenced 
by  recruiters  as  "lewer  quality"  recruits,  as  the  target 
cohort  fcculd  seem  tc  have  a  greater  selecticn  of  civilian 
cccupaticns.  The  individuals  in  the  target  cohort  may  have 
already  made  the  decision  to  enlist  based  on  ecciciic 
factors  frier  to  contact  with  a  production  recruiter.  And, 
it  is  ccssifcle  that  i€cruiters  really  don't  have  a  signifi- 
cant impact  when  compared  to  the  impact  of  unemployment  and 
compensation. 

The  studies  discussed  in  Chapter  Two  found  little 
significance  between  the  numbers  of  production  recruiters 
and  the  number  of  enlistments.  Perhaps  the  services  in 
general  and  the  Navy  in  particular  have  passed  the  point  of 
decreasing  returns  as  far  as  recruiters  are  concerned,  lhere 
may  be  seme  optimal  level  for  the  number  of  production 
recruiters  such  that  additional  production  recruiters  ever 
this  level  are  not  beneficial  in  enlisting  additional 
members  cf  the  target  cohort.  It  may  be  more  cost  effective 
to  invest  in  advertising  than  in  additional  recruiters.  The 
area  cf  costs  and  benefits  of  recruiters  versus  advertising 
is  an  area  uhich  deserves  additional  study. 

The  exclusion  of  the  variables  WBKFRCE,  PEBCUNE  and 
ONEMBAT  is  not  as  discouraging.  As  evidenced  in  the  regres- 
sion mcdels  presented,  when  these  variables  where  included, 
there  was  an  adverse  impact  on  the  T-ratio  for  the  variable 
□NEHflDI.  Since  the  values  of  the  variable  UNEMPLD  are 
included  in  the  variable  WBKfBCE,  and  the  values  cf  the 
variables  EEECONE  and  ONEMBAT  are  constructed  from  the  vari- 
able CNEHPLE,  multicclinearity  among  these  variables  may  be 
a  factor,  and  is  suggested  by  the  regression  results.  Ihe 
effects   of   unemployment   and  the  change   in   unemployment 
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appear   well  represerted  in  the   final  regression   eguation 
hithcut  including  these  three  additional  variables. 

E.   BlSllUklS    EXAHIHIT. 

Ite  Cur tin- Wat son  statistic  for  the  regression  was  1.52. 
As  previously  stated,  a  Durbin-Watson  statistic  between  1.5 
and  2.5  normally  indicates  a  lack  of  serial  (auto)  correla- 
tion fcr  a  data  set  cf  the  size  used  in  this  study.  With 
three  independent  variables  and  a  sample  size  of  60,  a 
Eurbir- Watson  statistic  between  1.69  and  2.31  would  assure 
no  serial  correlaticc  in  the  residuals.  The  cbtained 
statistic  cf  1.52  falls  in  the  inconclusive  range.  Since 
this  test  is  inconclusive,  additional  study  is  varrented. 

The  standardized  residuals  and  the  raw  residuals  were 
clotted  in  figures  4.1  and  4.2  The  value  of  a  standardized 
residual  was  determined  by  dividing  the  residual  by  its 
estimated  standard  deviation.  Of  particular  interest  here 
is  ary  value  outside  ♦/""  2.00.  In  Figure  4.1  there  are  only 
two  values  that  lie  cutside  the  specified  limits.  Since  the 
cverwreliing  number  of  residuals  lie  well  within  these 
limits,  nc  significant  bias  is  indicated.  In  Figure  4.2,  the 
residuals  are  plotted.  The  residual  mean  is  -0.00058120 
with  a  standard  deviation  of  7.89.  Examing  Figure  4.2,  there 
is  nc  ctviots  suggestion  of  a  pattern  in  the  data.  Bowever, 
the  Eurfcin-  Watson  statistic  of  the  firal  multiple  regres- 
sion mcdel  reguires  that  further  analysis  be  conducted  to 
detenine  if  serial  correlation  exists.  In  the  following 
chapter,  the  residuals  of  the  regression  will  be  examined 
further.  Combining  the  regression  model  developed  in  this 
chapter  with  an  ARIHA  process  as  described  in  chapter  3 
would  provide  an  improved  model,  assuming  serial  correlation 
in  the  residuals.  If  no  serial  correlation  exists,  then  the 
regression  oodel  aloce  would  prove  to  be  the  superior  mcdel. 
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V.  1HE  COMBINED  MO  DEI 

A.   CCBBI1EI  MODEL  Di1£IOPEC 

The  nultiple  reciessior  irodel  developed  in  Chapter  4 
satisfied  all  requirements  except  for  the  possibility  of 
serial  correlation  ir  the  residuals.  In  order  to  determine 
if  in  fact  a  serial  correlation  problem  existed  and,  if  so, 
to  attempt  to  correct  this  prctlem,  the  residuals  were  exam- 
ined further.  The  hypothesis  vas  that  if  an  ABIMA  model 
could  re  developed  fcr  the  residuals,  then  a  serial  correla- 
tion riorlem  existed.  However,  this  same  model  could  in  turn 
he  applied  as  a  correction  tc  the  regressicn  model  for  more 
accurate  forecasts. 
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The  nultiple  regression  model  residuals  are  displayed  in 
figure  4.2  The  autcccirelations  are  plotted  in  Figure  5.  1  . 
In  this  figure  only  the  autocorrelation  at  lag  13  falls 
outside  ♦/*"  2  standard  errors.  However,  no  physical  inter- 
pretation such  as  seasonality,  can  be  attributed  to  this 
spike,  and  it  can  be  interperted  as  not  being  significant. 
Since  there  is  no  clear  indication  of  a  nonstaticnary 
pattern  in  the  data,  a  white  noise  model  was  indicated  as 
nost  appropriate.  This  model  has  the  form: 


ZT  =  aT  (egn  5.1) 


where: 

2r  =  the  residual  value  at  time  t 
aT  =  a  random  shock  at  time  t 

This  indicated  that  a  model  could  not  be  developed  for 
the  residuals  and,  when  the  attempt  was  made  to  fit  various 
ABMA  models  to  the  data,  no  ABIMA  model  could  be  found  that 
was  apprcpriate  for  the  non differenced  residual  time  series. 
This  strcngly  suggests  that  no  serial  correlation  existed 
for  the  residuals  of  the  regression  model  developed  in  the 
previcus  chapter. 

Eeexanining  figure  5.1,  there  is  a  slight  indicaticn  of 
a  trend  in  the  data  if  the  general  shape  of  the  plot  between 
lags  1  and  13  are  observed.  In  order  to  explore  this 
further,  the  residual  time  series  was  differenced  once,  with 
the  resulting  autocorrelations  plotted  in  Figure  5.2 
Examining  this  figure,  the  isaediate  impression  is  that  the 
data  has  been  over  differenced.  This  is  suggested  fcy  the 
large  necative  spike  at  lag  1.  Differencing  a  stationary 
series  produces  another  staticnary  series.    However,   the 
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model  gererated  ty  tie  over  differenced  series  will  be  mere 
complicated  than  a  mcdel  generated  by  a  stationary  series 
obtained  with  the  minimum  amount  of  differencing  [Eef.  10]. 
At  this  point  it  was  decided  tc  continue  with  the  attempt  to 
construct  a  combined  model,  even  though  it  was  realized  ttat 
the  possibility  was  small  cf  constructing  a  combined  model 
which  would  prove  superior  to  the  regression  model. 
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Figure  5.2    DIFfffiEBCEC  BESID0AL  AUTOCORRELATIONS. 

lie  partial  autoccrrelaticns  of  the  differenced  residual 
time  series  are  displayed  in  Figure  5.3  .  Based  on  a  visual 
inspection  cf  the  autocorrelations  and  partial  autocorrela- 
tions a  second  order  autoregressive  (AB2)  model  was 
initially  selected  for  testirg.  However,  the  AB2  model 
proved  net  to  be  appropriate.  Since  any  moving  average  (HAg) 
process  can  be  rewritten  tc  form  an  AB  process,  the  next 
step  was  tc  test  a  MA2  model  on  the  differenced  time  series. 
The  tested  model  had  the  form: 
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;r-2r.  =AzT  =  n-  a,  b-  0tE2)€r 


(egn    5-2) 


fihere    ZT     is  the  residual  value   from    the    multiple    regression 
model   at   tine  T    and   €T  is      the   error   term   for   the   MA2    model. 
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figure   5.3         PAB1IAI    AOTOCOHBELATICNS. 


TAEIE  III 

HES1I0AL  HODEI  EVALUATION 

Residuals  of  model:  Bandom 
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Tarle  III  suggests  that  the  model  is  appropriate  for  the 
residual  time  series.  The  ARMA  model  for  estimating  future 
errors  cf  the  regression  model  then  becomes: 
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^Z^.     =     (1   -.^72  B    -.306    B2    j  er  (egn    5.3) 

lihere  ^ZT*  is  the  estimated  error  of  the  multiple  regres- 
sion   icdel   at   tine   t   and  z'r    - /^Z  T     *   Zr.i 

where : 

ZT*    =     predicted    value   of    the   residual   at   time   t 

l^Lf    =     change    in   tie   residual    value   from   time   t-1 

Zf-i     =      value    of   the  residual  at   time   t-1 

letting  YT  equal  the  predicted  value  of  the  iiulticle 
regressicn  aodel  alore,  the  predicted  value  of  the  comtined 
model,    fiT   ,    then  beccses: 
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HT    =   YT  ♦   ZT  (egn   5.4) 

However,   in  examining   eguation   5.3   and  taking   into 

consideration  the  standard  deviation  for   the  values   cf  0, 

and  Qz  ,        it  is  conceivable   that   eguation  5.3   could  be 
factored  as: 


^Z^=(1-E)  (1*0.  35B)  eT  (egn  5.5) 


Cancelling  cut  the  differenced  values,  the  eguation  reccmes: 


ZT  =  (U0.35B)  eT  (egn  5.6) 
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Equation  5.6  indicat€S  that  th€  time  series  was  over  differ- 
enced and  suggests  ising  a  MA  1  model  on  the  nondifferenced 
residual  time  series-  However,  this  had  previously  teen 
attempted  was  found  ret  to  be  appropriate.  To  this  pcirt  in 
the  analysis  o£  the  residuals  of  the  regression  model,  all 
tests  ccrducted  continue  to  indicate  that  the  residuals  are 
randon:  and  that  no  serial  correlation  exists.  The  only  test 
remaining  was  tc  ccnbine  the  regression  model  previously 
constructed  with  the  MA2  model  constructed  anove  and  to 
compare  the  results  cf  each  models  forecasts. 

£•   TEE  fOfiECAST  COflfABISOHS 

Three  acdels  have  been  developed  in  order  to  forecast 
net  new  contracts  signed.  The  best  test  of  each  model  would 
have  teen  tc  forecast  some  month  or  series  of  months  subse- 
quent tc  September  1983,  the  last  month  of  the  time  series, 
and  tc  compare  the  results.  Due  to  the  nonavailability  of 
appropriate  data  this  was  not  possible.  As  an  alternative, 
the  mcdels  were  used  to  forecast  the  last  five  months  of  the 
time  series,  May  1983  to  September  1983,  since  the  results 
were  known  and  the  data  required  for  forecasting  with  both 
the  multiple  regression  and  the  combired  model  was  avail- 
able. The  sequential  forecasting  method  was  utilized  for 
all  three  models  with  results  as  contained  in  Table  IV 

As  measured  by  the  root  mean  squared  error,  the  regres- 
sion model  is  superior  to  the  ARIMA  model  and  slightly 
superior  tc  the  combined  model.  If  the  residuals  of  the 
multiple  regression  mcdel  were  serially  correlated,  then  the 
combined  model  should  have  corrected  for  this  problem  and 
produced  forecasts  superior  to  the  multiple  regression 
model.  The  forecast  results  serve  to  confirm  that  the 
multiple  regression  residuals  were  not  serially  correlated. 
The  superiority  cf  the  regression  model  comes  primarily  from 
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TABLE    I? 
ECBECAS1    COMPARISONS 


MONTB 
MAY    63 


JJN 

JUL 
AUG 

SEP 


82 
63 
63 
63 


ACIOAL 
59 
60 
56 

75 


ABIMA 
60.1 
62. S 
78. C 
61.8 
53. C 


♦01 

♦0  4 

♦  39 

♦  26 

-29 


85? 

a* 

3% 
1* 
3X 


SOU    CF    SCOURED    ERRCSS: 
ARICA    flCC£L  :      1141.46 

REGEESSICN    MODEL:       294.42 
CCMEINEE    MODEL       :       412.06 


ROC1    *EAN    SQUARED    EEECBS    (BMSE) 
ARIHA    MCEEL  :  16.90 

REGEESSICN   MODEL:  9.93 

CCBE3NEE    MODEL       :  10.15 


REGRESS 
63.3    (♦07. 3%) 
♦05.7* 


63.4 
67.0 

64.6 
74.9 


(♦19.65? 
♦31.851 
-00.11) 


CCMEINfD 
68.0     (  +  15.8%) 
5S.8    (-00.3%' 
68.3    (♦22.0X 

57.7  (♦17. 6X 

64.8  (-13.65* 


HHEFE    EMSE   =  /Z<V~    ^r  )2 


(r   -    1) 


7r  -  actual  value 


YT  -  predicted  value 

n   *  5  (cumber  of  ctserva ticns) 

the  ability  to  relate  sudden  shifts  in  the  dependent  vari- 
able tc  measures  cf  unemployment  as  exhibited  by  the 
predicted  values  for  September  1983.  The  ARIMA  process  of 
the  combined  model  has  a  dampening  tendency  on  the 
predictiens  when  there  are  sudden  shifts  in  the  variables. 
The  regression  process  alone  would  then  become  the  pieferred 
model  during  a  period  of  economic  fluctuation  because  of  the 
exhibited  superior  ability  to  capture  these  sudden  shifts  in 
accessions. 

Ire  difficulty  in  forecasting  vith  the  multiple  regres- 
sion model  and  the  combined  model  is  that  one  must  also 
forecast  future  values  of  the  change  in  percent  of  unemploy- 
ment, the  variable  DEITAUN.    Even  with  forecasted  values  of 
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EELIACN,  the  forecast  horizon  is  limited  to  one  month  due  to 
the  cce  mccth  lag  for  unemployment, the  variable  UNE£PID, 
unless  future  values  for  that  variable  are  also  forecasted, 
for  any  period  beycrd  one  month,  the  forecast  for  cet 
contracts  signed  thee  deperds  on  the  accuracy  of  these  two 
additional  forecasts,  decreasing  the  face  validity  of  the 
prediction  due  to  coirpounding  negative  effects.  The  attrac- 
tiveness of  the  Eox-Jenkins  approach  is  that  only  the  values 
cf  the  tine  series  are  taken  into  consideration  in  the  fore- 
casting process.  However,  the  basic  assumption  of  continuity 
in  the  pattern  of  tie  data  must  hold  generally  true  if  the 
results  are  to  be  of  value. 
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VI.  CCNCIOSIONS 

A.   SCflflJil 

Itree  mcdels  utilizing  two  distinct  analytical  methodol- 
ogies have  teen  preserted.  The  Box-Jenkins  methodology  uas 
applied  to  a  monthly  time  series  of  net  enlistments  iE  the 
Albuguergue,  New  Mexico  recruiting  district  and  a  todel  of 
the  underlying  data  patterns  developed.  A  multiple  regres- 
sion acdel  atteaptirg  to  examine  causal  relationships  has 
derived  rased  on  intuition  and  the  statistical  significance 
cf  the  independant  variables.  A  combined  time  series/causal 
model  was  developed  treating  the  residuals  from  the  regres- 
sion lodel  as  an  original  time  series  and  applying  the 
Eox-Jenkins  technique  to  them.  Forecasts  were  conducted  with 
the  three  mcdels  and  the  results  examined. 

A  comparison  of  the  root  mean  squared  error  of  the  resi- 
duals cf  the  three  models  indicated  that  the  multiple 
regression  model  was  superior  to  both  the  Box-Jenkios  model 
and  the  combined  model.  Accuracy  favored  the  multiple 
regression  approach,  however  the  need  to  develop  forecasts 
for  two  principle  irdependent  variables  limit  its'  useful- 
cess  in  forecasting.  A  need  to  forecast  extended  periods 
would  favor  the  Box-Jenkins  approach  since  there  does  cot 
exist  a  requirement  to  include  the  independent  variables  of 
the  multiple  regressicn  model.  Both  the  multiple  regression 
model  and  the  Box-Jenkins  methodology  can  serve  useful 
purposes  when  the  inherent  limitations  of  each  are  taken 
into  consideration. 
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E.   1EELICATI0IS  OF  TEE  BES0I1S 

lh€  multiple  regression  results  suggested  three  vari- 
ables were  primarily  responsible  for  the  variaticr.  in  the 
ret  number  cf  contracts  signed.  These  were  the  ordinal  value 
of  unemt. lcyment,  the  change  in  percent  unemployed  anc  the 
military  /  civilian  jay  ratio.  Of  these  three  variables, 
the  Eepartment  of  Defense  (LCD)  exerts  only  a  partial  influ- 
ence ever  the  military/civilian  pay  ratio.  As  alluded  to 
previcusly,  no  single  measure  of  the  military/civiliac  pay 
ratic  car  be  justified  as  a  "best"  measure  of  this  relaticn- 
ship.  Tie  one  presetted  in  this  study  was  felt  tc  be  an 
adeguate  estimation  cf  how  young  men  making  an  occupational 
choice  view  the  relationship  between  military  and  civilian 
compensation.  This  ratio  did  not  take  into  account  such 
forms  cf  compensation  as  medical  benefits,  enlistment 
bonuses,  cemmisary  aid  exchange  privileges,  special  fays, 
and  ether  benefits.  Each  individual  places  different  values 
on  these  additional  forms  of  compensation  depending  on 
his/her  cwn  needs  and  desires.  It  may  be  that  these  ether 
forms  cf  cempensatior  provide  the  marginal  benefit  that  sway 
a  majority  cf  enlistees  towards  an  occupation  in  the  mili- 
tary. The  effects  of  these  additional  forms  cf  compensation 
have  t€€L  the  focal  feint  fcr  many  studies  and  their  effects 
deserve  ccntinued  research. 

As  fcr  unemployment  and  the  change  in  percent  unem- 
ployed, little  can  be  done  by  DOD  to  influence  these  vari- 
ables. Indeed,  it  is  unreasonable  to  assume  that  anyene 
would  desire  to  place  the  economy  of  the  nation  in  ceril 
simply  tc  ease  recruiting  short  falls.  However,  it  is  clear 
that  general  economic  factors  play  a  pivotal  role  in  the 
cccupaticnal  choice  crccess  and  that  the  effects  of  changes 
in  the  eccncmic  indicators  can  have  an  immediate  impact.  A 
sudden  increase  in  civilian  job  market  opportunities  for  the 
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target  cchcrt  would  have  an  imaediate  and  adverse  impact  en 
Navy  recruiting.  During  periods  of  general  economic  improve- 
ment, acre  resources  will  re  required  to  obtain  siailar 
rumrers  of  high  quality  enlistees  than  during  a  pericd  of 
general  economic  decline. 

Remaining  unexplained  in  the  analysis  is  the  shift  in 
the  time  series  from  apparent  seasonality  over  the  first  two 
thirds  of  the  series  to  apparent  nonseasonality  ever  the 
last  third  of  the  series.  Seasonality  in  recruiting  with 
peak  mortis  in  August  and  January  can  most  easily  be 
explained  ry  the  schcel  year.  Intuitively,  a  young  man  would 
not  desire  to  graduate  from  high  school  in  June  and  directly 
enter  lilitary  service.  Or  the  average,  one  would  expect 
him  tc  desire  to  remain  among  his  friends  for  the  Sumner 
months,  entering  the  service  at  the  end  of  Summer  when 
friends  either  return  to  school  or  enter  the  job  market.  The 
peak  month  of  January  can  be  intuitively  explained  by  the 
potential  recruits  desire  to  remain  among  family  and  friends 
during  the  holiday  season  prior  to  entering  the  service, 
fihat  is  not  readily  ejplained  is  the  virtual  absence  of  this 
seasonality  over  the  last  third  of  the  time  series, 
fiememhering  that  the  dependent  variable  is  contracts  signed 
and  not  accessions,  what  may  explain  the  change  in  the  data 
pattern  is  a  change  in  the  gucta  system.  Utilizing  the  CEP 
pool,  the  potential  recruit  is  currently  able  to  sign  a 
contract  with  the  Navy  and  delay  entry  into  the  service  for 
a  period  of  up  to  one  year.  This  may  have  the  effect  of 
dampening  the  seasonal  pattern  exhibited  by  the  first  two 
thirds  of  the  time  series.  However,  this  is  pure  conjecture 
as  there  was  insufficient  data  available  to  conduct  further 
analysis  or  draw  any  definite  conclusions. 
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C.   BICCHflEHDATICNS  JCB  FOTOBE  STUDY 

The  ncdels  presented  in  this  study  should  be  refined  and 
updated-  The  60  data  point  time  series  utilized  is  tco  brief 
to  accurately  depict  cyclic  flucuations  in  the  data  set.  Ihe 
time  series  should  te  expanded  as  additional  data  points 
become  available.  Estimates  from  models  developed  from  time 
series  cf  adequate  length,  in  excess  of  100  data  feints, 
could  picve  beneficial  in  allocating  quotas  and  recruiting 
resources.  Begressicr  analysis,  examining  causal  relation- 
ships en  a  recruiting  district  level,  should  continue  in 
order  that  a  better  understanding  of  the  effects  that  local 
econcnics  have  on  the  supply  of  potential  enlistees  can  be 
obtained.  Cne  area  ci  potential  intrest  may  be  the  applica- 
tion cf  leading  indicator  models,  an  extension  of  the 
Eox-Jenkins  methodology,  to  the  problem  of  predicting  the 
supply  of  recruits. 

Cue  tc  lack  cf  sufficient  available  data,  several  refin- 
ments  in  this  study  which  the  author  feels  are  important 
could  not  be  made.  first,  in  the  area  of  unemployment,  an 
accurate  breakdown  by  sex,  age,  and  ethnic  group  by 
recruiting  district  would  have  been  of  significant  benefit. 
lo  make  an  appreciable  improvement  in  the  understanding  of 
the  interaction  of  local  economic  conditions  on  the  supply 
cf  potential  enlistees  this  data  will  be  necessary  for 
future  studies.  Sose  sort  of  working  agreement  with  the 
Eepartment  cf  Labor;  Bureau  of  Labor  Statistics  would  be 
necessary  in  order  tc  obtain  the  data  in  a  useful  fen.  A 
restructing  of  the  recruiting  districts  would  make  data 
gathering  easier.  A  realignment  along  state  boundries  would 
sake  close  coordination  with  state  governments  mere  readily 
avialable.  There  could  be  a  cooperative  effort  between  the 
state  unemployment  office  and  tne  recruiting  district. 
Although  seme   states   would  be   reluctant   to   act   as   a 
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Cf  particular  benefit  would  be  a  regional  study  encom- 
passing accessions  icto  all  four  branches  of  the  nilitary 
and  the  Coast  Guard.  A  study  of  this  type  would  permit  the 
examiration  of  the  interaction  of  the  different  recruiting 
efforts.  1  his  would  assist  in  determining  if  an  increase  in 
the  recruiting  efforts  of  any  one  service  served  to  increase 
the  tctal  number  of  accessicns,  or  merely  drew  away  poten- 
tial recruits  frcm  one  of  the  other  services.  Additicrally, 
by  cciparing  the  regional  economy  to  total  military  acces- 
sions for  the  region,  a  clearer  understanding  of  the  effects 
cf  changing  econcmic  conditiens  could  be  obtained. 
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PfilMEB  CN  TIHE  SERIES    ANALYSIS 

A.   UflE  SEBIES  AHAIISIS 

Eiceft  hiiere  otherwise  noted,  this  appendix  is  tased  on 
the  works  of  5.  tfakridakis  and  S.C.  Wheelwright.  A  time 
series  is  a  collection  of  observations  made  sequentially 
in  tiie.  Examples  occur  in  a  variety  of  fields  from  physics 
to  physiclcgy  and  methods  of  analyzing  time  series  consti- 
tute an  important  area  of  statistics.  Statistical  techniques 
for  analyzing  time  sexies  range  from  the  straight  forward  to 
the  very  sophisticated.  However,  traditional  means  of  time 
series  aralysis  are  primarily  concerned  with  decomposing  a 
series  intc  the  basic  components  of  trend,  seasonality, 
cyclic  chances  and  other  fluctuations. 

The  first  step  in  analyzing  a  time  series  is  to  plot  the 
observations  against  time.  Figure  A. 1  is  a  time  series  plot 
of  the  sales  figures  for  a  certain  engineering  company  over 
successive  months  [fief.  11]*  This  plot  can  be  used  to 
obtain  simple  descriptive  measures  of  the  main  properties  of 
the  series.  For  example,  locking  at  Figure  A. 1,  it  can  be 
seen  that  there  is  a  regular  seasonal  effect  with  sales 
'high*  ir  the  winter  and  'low'  in  the  summer.  The  graph 
suggests  an  upward  trend  in  annual  sales.  For  seme  time 
series,  the  variation  is  dominated  by  such  'obvious' 
features,  and  a  fairly  simple  model  may  be  perfectly 
adequate  to  describe  the  variation  in  the  time  series,  for 
ether  time  series,  more  sophisticated  techniques  will  be 
required  tc  provide  adequate  analysis. 

When  observations  are  taken  on  two  or  more  variables,  it 
may  be  possible   to  use  the  variation  in  one   time  series  to 


Figure  A.1    Hcnthly  Sales. 

explain  the  variation  in  another  time  series.  This  nay  lead 
to  a  deeper  and erst aiding  of  the  underlying  causal  relation- 
ships in  a  process  and  permit  accurate  forecasts  of  future 
values  of  the  variable  of  interest.  This  may  in  turn  aid  in 
the  ccrticl  of  that  future  value. 

The  ability  of  a  given  technique  to  forecast  effectively 
in  a  specific  situation  depends  largely  on  accurately  iden- 
tifying the  patterns  underlying  the  data  and  selecting  the 
proper  technique  to  handle  them.  All  time  series  analysis 
technigues  assume  that  the  patterns  underlying  the  data  are 
constant  over  two  periods;  the  period  over  which  the  data 
uas  collected,  and  the  period  of  the  forecast.  However, 
different  technigues  are  available  to  fit  a  vide  variety  of 
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situations.      The     factors   that      need   to      be  considered      when 
selecting   a   model   arc   as  follows: 

1.  The   time  horizon   for  decision   making 

2.  Ihe  pattern  of    tie   data 

3.  Type    ci  model   desired:    tine   series   or   causal 

4.  lie   value    of   the  forecast 

5.  Ihe   accuracy   that  is  reguired 

6.  Ihe   complexity    that  can  he   tolerated 

7.  Ihe   availability  of   historical   data   [fief.    8] 

E.       TEE    EOI-JEIKIIS    EETBODOIOGI 

1.      Ihe   Advantages   of   Box-Jenkins 

Ihe  Box-Jenkins  methodology  is  an  efficient  and 
practical  procedure  fcr  handling  time  series  and  ether  fore- 
casting situations  in  which  a  variety  of  complex  patterns 
exist.  This  methodolcgy  can  handle  complex  patterns  of  data 
using  a  relatively  well  specified  set  of  rules  that  dc  net 
require  that  a  model  re  chosen  on  the  basis  of  an  analyst's 
experience,    intuitive  ability   or   theory. 

Eox-Jenkins  is  the  most  general  method  of 
apprcaching  time-series  forecasting.  There  is  nc  need  to 
ass u He  initially  some  fixed  pattern  to  date.  The  approach 
begins  by  assuming  a  tentative  pattern  that  is  fitted  tc  the 
data  so  that  the  error  will  be  minimized.  It  additicially 
provides  explicit  information  to  enable  the  user  to  judge 
whether  the  pattern  tentatively  assumed  is  correct  for  the 
situation  under  study.  If  so,  the  forecast  can  be  developed 
directly  and,  if  net,  the  Eox-Jenkins  approach  provides 
further  clues  for  identifying  the  correct  pattern.  This 
procedure  allows  the  user  to  arrive  at  a  forecasting  model 
that  achieves  optimization  in  terms  of  the  basic  pattern  and 
ninisizes  the  forecasting  error.  The  user  is  also  supplied 
with  statistical  information  on  the  accuracy  of  the  fore- 
casts. 
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2  •   Ihe  General  Approach 

In  describing  their  abroach,  Box  and  Jenkins  have 
developed  the  scheaatic  diagram  shown  in  figure  A.  2 
£fief.  12].  This  abroach  divides  the  problem  intc  three 
stages.  In  stage  one,  a  specific  model  is  suggested  as  the 
forecasting  method  best  suited  to  that  situation.  In  stage 
two,  that  suggested  acdel  is  fitted  to  the  historical  (time 
series)  data  and  the  suggested  model  is  checked  to  determine 
its  adequacy.  If  the  model  is  not  adequate,  the  analyst 
returns  to  stage  one,  and  an  alternative  model  is  propcsed. 
fihen  an  adequate  model  is  identified,  the  development  cf  a 
forecast  fcr  some  future  time  period  is  conducted  in  stage 
three. 

2.   She  Hole  of  autocorrelation 

An  autocorrelation  ccefficient  is  similar  to  a 
correlation  coefficient  except  that  it  describes  the  associ- 
ation amcng  values  cf  the  saie  variable  but  at  different 
time  periods.  The  autocorrelation  among  successive  values  of 
the  data  is  a  key  tool  in  identifying  the  most  appropriate 
model  corresponding  to  the  basic  pattern  of  the  data. 
Correlation  implies  nothing  about  a  change  in  one  variable 
causing  a  change  in  the  other  variable. 

1c  further  deionstrate,  we  can  construct  an  artifi- 
cial variable  from  another  variable  by  changing  the  time 
origin  cf  the  data.  For  example,  dropping  the  first  value 
cf  'A'  and  letting  the  second  value  be  the  starting  value  of 
'5*,  See  figure  A. 3  .  In  this  example  variables  'A'  and  *Bf 
can  be  treated  as  two  separate  and  distinct  variables  even 
though  they  came  from  the  same  data  set.  In  this  example, 
•B'  has  a  time  lac  of  one  period  from  •A1.  This  same 
approach  can  be  expanded  to  create  additional  data  sets 
utili2ing  as  many  time  lags  is  as  deemed  necessary  for  iden- 
tifying the  appropriate  model. 
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Figure   A.  2        Box-Jenkins  Method. 
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Figure  A. 3    lagged  Variables. 

He  can  consider  variables  •A1  and  •B*  as  two  vari- 
ables and  calculate  their  ccrrelation  coefficients.  A  coef- 
ficient cf  .80  between  •A*  and  '  Bf,  for  example,  would  imply 
that  successive  values  of  *A'  with  one  period  (lag)  between 
them  are  positively  correlated  with  each  other  and  teed  to 
move  in  the  same  direction.  Siiilarly,  a  coefficient  of  -.70 
between  'A'  and  'B*  would  indicate  that  successive  values  of 
'A'  lagged  ence  are  negatively  correlated  and  tend  to  move 
in  oppesite  direc tiers.  However,  since  variable  *E'  is 
actually  derived  froa  variable  'A',  such  an  association  is 
called  arte- (self-)  ccrrelation. 

Autccorrelaticns  provide  significant  clues  about  the 
underlying  pattern  cf  a  data  set.  In  a  set  of  ccapletely 
random  data  the  autccorrelations  among  successive  values 
will  be  clcse  to  zeic,  but  data  values  of  strong  seasonal 
and/or  cyclical  character  will  be  highly  correlated, 
figure  A. 4  £Bef.  13]  for  instance,  presents  the  autccorrela- 
tions cf  different  time  lags  of  monthly  temperature  in 
Paris.  These  autoccrrelations  reveal  a  strong  seasonal 
pattern. 
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It  is  this  type  of  information  gained  from  autocor- 
relations that  can  re  utilized  by  the  Box-Jenkins  approach 
to  identify  the  optiaal  forecasting  model.  Of  particular 
interest  is  that  th€  analyst  reeds  to  know  nothing  of  the 
data  set  or  its  patterns  to  ottain  its  autocorrelaticn  coef- 
ficierts.  Ihese  coefficients  can  be  used  to  describe  the 
data  set  and  assist  in  identifying  a  tentative  model  to  be 
fitted  tc  the  data. 

fchat  Figure  A. 4  implies  is  that  temperature  of 
months  twelve  periods  apart  are  highly  correlated.  Although 
this  is  rasically  intuitive,  if  this  were  not  previously 
known  the  autocorrelations  would  be  of  significant  benefit 
in  suggesting  an  appropriate  model  to  describe  the  data 
pattern  £Bef.  13 J. 

**•   £ox-Jenkins  Hcdel  Typ.es 

Ifce  Box-Jenkins  methodology  postulates  three  general 
classes  cf  models  that  can  generally  describe  any  type  or 
pattern  cf  time  series  data: 

a.  Autcregressive    <£B) 

b.  Moving   Average    (flA) 

c.  Mixed   Autoregressive-flcving    Average    (ARMA) 

An   autoregressive  model    (Afl)    has   the   form: 


*r    S<2>ST.(  +9iYr.x+9aV^---+9,Xr^+er  <€SD    A'1> 


where  Y7 is  the  independent  variable,  high  guality  recruits, 
and  XT.|  #Xr-i  t^r-i  #•••#*!--*  are  "independent"  variatles. 
In  this  case  the  independent  variables  are  values  cf  the 
same  variable,  but  of  previous  periods  (t- 1,t-2, t-3,. . . t-p) . 
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finally  e  is  the  error  or  residual  term  that  represents 
randca  events  that  are  not  explained  by  the  model.  ihe 
residuals  (errors)  of  the  model  should  be  white  noise,  that 
is  they  should  have  a  mean  of  zero,  a  constant  variance  and 
he  unccrrelated  ever  time. 

Ihe  model  described  is   autoregressive  because  it  is 
similar  tc  the  regression  equation 


Y=a*r,  I,  ♦t2Xz*t3X3*...+bf  Xf  *€r  (egn  A. 2) 

except  that: 

and  thus  the  independent  variables  are  only  lagged  values  of 
the  dependant  variable  with  tine  lags  of  1,  2,  3,...,  p 
periods.  Ihe  autoregxession  model  is  similar  in  function  and 
form  tc  the  regression  equation.  However  it  is  net  appro- 
priate tc  describe  all  possible  underlying  data  patterns  in 
time  series  analysis.  Thus,  the  Box-Jenkins  approach  also 
considers  tuo  other  classes  cf  models. 

Ihe  Moving  Average  (HA)  model  has  the  form: 

YT  =  €T-0ier.|  -0^^  -0,er.s  -...-^e  (egn  A. 3) 


where  eT  is  the  errcr  or  residual  and  er_(  ,  eT.v  ,er-3 
,...,eT.<   are  previous     values   of      the   error.  This   iijlies 

that  the  dependant  variable  Yr  ,  depends  on  previous  values 
cf  the  errcr  term  (er_,  #er.t  #er„,  ,...,er.«)  rather  than  on 
the      variable  itself.        The      future      value   of      high      guality 
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enlistments  could  be  predicted  by  utilizing  the  errcr  of 
each   of   several    past   predictions.  In    the  same    manner   that 

autocorrelation  among  successive  values  of  Y  can  te  exam- 
ined, the  autocorrelation  among  the  successive  values  of  the 
residuals   can  also   be   examined. 

Ihe  third  class  of  model  considered  by  the 
Eox-Jenkins  methcdolcgy  is  a  mixed  model.  The  best  descrip- 
tion of  the  pattern  of  data  may  be  provided  by  a  mixed 
process  cf  Afi  and  MA  elements.  Ihe  general  mixed  mcdel  has 
the    form: 


IT-9(Xr_,  ♦0JT.4*.  -^r-f  ♦eT-©.er^-e«r.I-  -'Sfr-fc  (€9D    A-U) 


It  is  evident  that  this  model  is  simply  the  AR  and  HA  mcdels 
combined.  It  suggests  that  future  values  of  high  guality 
recruits  depend  en  bcth  past  values  of  high  guality  enlist- 
ments and  the  errors  between  forecasts  and  actual  values. 

5.   Identifying  a  Tentative  Model 

It  is  possible  to  suggest  some  specific  ARMA  (p,g) 
model  by  examining  the  autocorrelation  coefficients  and  a 
similar  set  of  parameters,  the  partial  autocorrelation  coef- 
ficients. Partial  autccorrelaticn  coefficients  are  analcgcus 
to  autccorrelaticn  coefficients  in  that  they  indicate  the 
relationship  of  the  values  in  a  time  series  to  various 
lagged  values  of  the  same  series.  However,  they  differ  ficm 
autocorrelations  in  that  they  are  computed  for  each  time  lag 
after  reioving  the  effect  of  all  other  time  lags  in  the 
given  tise  lag  and  on  the  original  series.  In  essence,  they 
show  the  relationship  that  exists  for  varying  time  lags 
£Bef.  8]. 
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Ccmtinations  cf  autocorrelation  and  partial  autocor- 
relation coefficients,  in  the  absence  of  randomness,  can 
reveal  tie  exact  ARMA  model.  Randomness  complicates  the  task 
tecause  it  influences  the  values  of  the  autocorrelation 
coefficients  and  may  cause  them  to  deviate  from  their  "true" 
patterr.  This  makes  model  identification  more  difficult,  tut 
generally  there  is  ercugh  information  available  to  select  an 
appropriate  model.  Even  if  there  is  a  mistake  at  this  point 
in  the  aralysis,  it  hill  be  found  when  a  test  of  the  model 
is  conducted.  Figures  A. 5  through  A. 9  demonstrate  the 
behavior  cf  the  theoretical  autocorrelations  (ACs)  and 
partial  autocorrelations  (PACs)  for  each  model,  where  "HA" 
denotes  a  loving  average  model,  "SUA"  denotes  a  seasonal 
Hovirg  average  model,  "AR"  denotes  an  autoregressive  model 
and  "SJB"  denotes  a  seasonal  autoregressive  model. 
£Bef-  10] 

Thus  in  order  to  forecast  a  time  series,  the  auto- 
correlations and  partial  autocorrelations  must  be  computed 
first.  Sometimes  the  pattern  of  the  computed  autocorrela- 
tions and  partial  autocorrelations  can  be  easily  classified 
as  one  of  the  theoretical  ones  described  below.  The  identi- 
fication cf  an  appropriate  model  is  then  relatively  easy  . 
Other  tines,  some  free  association  and  intuitive  judgement 
is  reguired  to  be  able  to  infer  a  pattern  from  the  autocor- 
relations, or  mere  than  one  pattern  may  be  suggested.  In 
such  instances,  a  choice  can  be  made  based  on  goodness  of 
fit  and  nuaber  cf  parameters  involved.  Once  a  tentative 
model  has  been  made,  the  next  step  is  to  estimate  the  param- 
eters. 

£•   Estimating  tie  Parameters 

Assuming  that  the  tentative  model  is  an  ARHA  (1/1) 
its'  mathematical  form  is 
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Yr  =  <2>}r.t   +er  -  €>,%_,  <e<3B  A. 5) 

in  order  tc  utilize  this  model,  the  value  of  CD  and  Q  must 
be  estimated.  The  general  approach  is  to  start  with  seme 
estimated  initial  values  for  CD  and  0  and  then  modify  them  by 
small  steps  while  observing  the  sum  of  the  squared  errors 
<SSE) .  Ihis  allows  tie  direction  of  change  in  CD  and  0  that 
result  in  the  smallest  SSE  to  be  determined.  Eventually, 
the  ©acd  0  corresponding  to  the  minimum  SSE  are  found  and 
used  as  the  final  estimates  for  the  model.  The  next  step  is 
to  test  the  adequacy  cf  the  model. 

7 .   Testing  the  Ecdel's  Adequacy 

If  the  lodel  is  adequate,  the  residuals  (differ- 
ences) between  the  tine  series  values  and  those  estimated  by 
the  scdel  mest  be  random  (white  noise).  The  autocorrelation 
coefficients  for  these  residuals  are  examined.  If  none  of 
the  autocorrelation  coefficients  of  the  residuals  are 
significantly  different  from  zero  (plus  or  minus  twe  stan- 
dard errors),  the  errors  are  assumed  to  be  white  noise  and 
the  model  is  adequate.  If  the  autocorrelations  are  not 
random,  the  analyst  must  return  to  step  two  and  select 
another  model. 
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Figure  A. 8   HA  (2)  flodel  ACs  and  PACs. 
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